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Abstract 

We study the back reaction of cosmological perturbations on the 
evolution of the universe. The object usually employed to describe 
the back reaction of perturbations is called the effective energy- 
momentum tensor (EEMT) of cosmological perturbations. In this 
formulation, the problem of the gauge dependence of the EEMT 
must be tackled. We advance beyond traditional results that involve 
only high frequency perturbations in vacuo JjJ], and formulate the 
back reaction problem in a gauge invariant manner for completely 
generic perturbations. We give a quick proof that the EEMT for 
high-frequency perturbations is gauge invariant which greatly sim- 
plifies the pioneering approach by Isaacson Q. As applications we 
analyze the back reaction of gravitational waves and scalar met- 
ric fluctuations in Friedmann-Robertson- Walker background space- 
times. We investigate in particular back reaction effects during infla- 
tion in the Chaotic scenario. Fluctuations with a wavelength much 
bigger than the Hubble radius during inflation contribute a negative 
energy density, and in that case back reaction counteracts any pre- 
existing cosmological constant. Finally, we set up the equations of 
motion for the back reaction on the geometry and on the matter, and 
show how they are perfectly consistent with the Bianchi identities 
and the continuity equations. 



Work based on the Ph.D. thesis by the author, Brown University (1997). 
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Chapter 1 

Introduction 



It is a well-known fact in oceanography that waves in the sea carry some amount 
of energy, which is released in the form of thermal energy as the waves crash 
on continental shorelines. This energy is responsible for a small fraction of the 
total thermal energy stored in the oceans' water. Since waves pass through 
each other without much dispersion, the problem of propagation of sea waves 
can be treated linearly, and in fact oceanography analyses the several different 
classes of waves (of different amplitudes and wavelengths) independently from 
each other. Waves "average to zero" in the sense that the overall sea level, for a 
constant amount of water, is set at meters. Nevertheless, the average energy 
stored in waves is not zero, since it is not a linear but really a quadratic function 
of the wave. This second order phenomenon is the simplest case of a nonlinear 
feedback effect, or back reaction, in fluid dynamics. 

A close analogy can be drawn between sea waves (called, ironically, "waves 
of gravity" in oceanographic jargon) and the coupled matter and metric per- 
turbations (gravitational perturbations for short) with which we will concern 
ourselves here. The sea level and the temperature of the sea water correspond, 
respectively, to the background geometry upon which the waves (gravitational 
perturbations) propagate, and their energy density; the average of a gravita- 
tional perturbation is zero as well; the problem of propagation of perturbations 
can be treated in linear theory as a first approximation, although the exact 
problem, like fluid dynamics, is fundamentally nonlinear; the energy stored in 
the gravitational perturbations is, similarly, a quadratic function of the per- 
turbations; finally, metric and density perturbations can effect the background 
geometry and energy density in which they propagate. 

The theory of General Relativity (2[ ||, ||, used here to describe gravitational 
systems, is much more complex than fluid dynamics in that one of its key ele- 
ments is covariance under general gauge transformations that change the coor- 
dinate frames of observers in the manifold spacetime. Although this constraint 
on Einstein's equations can be dealt with in exact problems containing sym- 
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metries and even in the linear theory of perturbations, to second order (when 
the quadratic effects associated with the energy carried by the waves appear), 
a formalism appropriate to the situation at hand has been sorely lacking in the 
literature. Consequently, many difficulties arise in the treatment of problems 
related to back reaction such as long calculations, lack of well-defined physical 
observables and, gravest of all: gauge dependence of the source of back reaction 
effects (which would correspond to the sea temperature), called the effective 
energy-momentum tensor of gravitational perturbations. 

We separate gravitational perturbations in two broad classes: fluctuations 
which are of short wavelength and high frequency (HF), and the ones which 
have long wavelength and low frequency (LF). Although the problems cited 
above have been partially solved in the case of HF perturbations |Q, we will see 
that, quite surprisingly, the decisive contribution from back reaction in at least 
one case (the Chaotic Inflationary scenario Q) comes actually from the soft LF 
perturbations, and not from the hard HF ones. 

We will show how a crucial ingredient in solving these problems is the fact 
that even linear gauge transformations (i.e., transformations that simply rede- 
fine the perturbations of matter and metric) can change the characteristics of 
the background geometry, as well as its dynamics. With that element in hand, 
we will show how it is possible to construct physical observables (gauge invari- 
ant quantities) which measure the effects of the back reaction of gravitational 
perturbations. 

We apply this formalism to the problems of back reaction of gravitational 
waves (metric perturbations which are decoupled from matter sources) and 
scalar perturbations (coupled matter and metric fluctuations) in homogeneous 
and isotropic Friedmann-Robertson- Walker (FRW) background geometries. One 
of the results is that, due to back reaction, the equation of state of long wave- 
length gravity waves suffers corrections that have not been included in previous 
treatments. 

The main result of our analysis comes from an application to the case of 
scalar perturbations during inflation in the chaotic model, where we have shown 
that back reaction can reduce the speed and the duration of inflation. The power 
of the feedback effects comes not from the amplitude of the perturbations, which 
is always small, but from the fact that the expansion of the universe redshifts 
the physical wavelengths of perturbations causing an exponential growth of the 
population of soft infrared (IR) modes. 

As a spinoff of these ideas we suggest that back reaction might provide a 
dynamical relaxation mechanism whereby a preexisting cosmological constant 
is screened. 

This work is organized as follows: in Chapter 2 we treat the problem of 
propagation of coupled metric and matter perturbations in FRW spacctimes, 
with special care given to the simple but interesting case of scalar field matter. 
Chapter 3 deals with the Chaotic Inflationary scenario, and with the process of 
generation of cosmological perturbations by quantum fluctuations of the scalar 
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field. These introductory chapters can be skipped if the reader is familiar with 
modern cosmology. 

We give a brief historical account of early as well as recent attempts to 
tackle the back reaction of gravitational perturbations in Chapter 4, and show 
how the problem in the context of HF perturbations can be formulated in a 
simpler way than before. In Chapter 5 we discuss the theory of finite gauge 
transformations, which we argue is fundamental to an understanding of some 
conceptual as well as practical issues related to back reaction. We also provide 
a covariant formulation of back reaction in generic backgrounds. Finally, in 
chapters 6 and 7 we develop applications to the case of gravity waves and scalar 
perturbations. 
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Chapter 2 

The Theory of Cosmological 
Perturbations 

Recent years have seen a confirmation of the basic working hypotheses of Cos- 
mology: 1. that the universe is approximately homogeneous, with small pertur- 
bations superimposed onto this smooth background, and 2. that the universe 
is expanding, its energy density falling, and the temperature of the primordial 
background radiation is decreasing. The COBE satellite^ and a series of sim- 
ilar experiments fij have determined this fact beyond any dispute, and now it 
is finally clear that the plasma that predominated in the universe up to some 
300,000 years after the Big Bang was extremely homogeneous, in fact its tem- 
perature did not vary more than one part in 10 5 . 

The structures on large scales that we see with the help of telescopes today 
(planets, clouds, galaxies, clusters of galaxies and so on) are but the result of 
a few billion-years of clumping of these tiny inhomogeneities in the plasma. 
The theory of the origin and evolution of these cosmological perturbations has 
become a cornerstone of modern cosmology, since by virtue of the theory of 
cosmological perturbations it is finally becoming possible to falsify models of the 
evolution of the early universe, as well as to test some of the underlying tenets 
of relativistic cosmology. The observation of the cosmic background radiation 
anisotropies, together with the theory of their origin and evolution, provides the 
matrix on which the basic parameters of cosmology (expansion rate, density, 
ratio of baryonic matter to total matter, deacceleration parameter and so on) 
have to be fitted. 

Having established the importance of the study of perturbations in cosmol- 
ogy, it is to this we now turn. 
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2.1 Gravitational instability 



Soon after publishing his seminal paper on General Relativity, Einstein used 
his theory to try and describe the Cosmos. It is important to notice that by 
1917, when he published his "Kosmologische Betrachtungen zur allgemeinen 
Relativitatstheorie" || , E. Hubble had not yet established Slipher's suggestion 
that distant galaxies are receding, and therefore as far as anyone was concerned 
the Universe was static. Einstein soon realized that his theory would not lead to 
such a static Universe, and in order to achieve that he introduced a "universal 
constant" , now widely known as the cosmological constant A. In the ensuing 
model, a closed universe is kept static by the repulsive gravitational force coming 
from a fine-tuned cosmological constant. It just turns out that this model is 
unstable: it would expand (collapse) at an accelerated rate if the matter was 
slightly less (more) dense than some critical density. 

The recent history of large-scale structure in the Universe is dominated by a 
similar type of gravitational instability: galaxies, for example, formed from gas 
clouds that collapsed by gravitational attraction around the galactic center. By 
the same process, smaller (stars, planets) and bigger (clusters and superclusters) 
structures also formed from this coarsening of a quasi-homogeneous expanding 
fluid. An evolution scenario that starts by forming the largest structures first is 
known as "top-down" , while one in which the small structures form first is known 
as "bottom-up" . The intermediate picture, of "scale-invariant" evolution, is the 
one that is currently favored by experimental data. 

In the same manner that an overdensity is unstable to gravitational attrac- 
tion and can grow to become a galaxy, for example, underdensities are also 
unstable and grow with time to form voids, regions with a much lower den- 
sity of galaxies. In that case it is useful to think of an "effective gravitational 
repulsion" || acting on the matter in the boundaries between the voids, and 
pushing them together into pancakes. It is not clear which are the structures 
that dominate the Universe at the present time, but it has been argued that in 
the scale of clusters of galaxies, voids are in fact more pervasive than clusters^ . 

Perturbations in the homogeneous fluid of a static Universe, as often happens 
with instabilities in Partial Differential Equations, grow in amplitude exponen- 
tially (their physical size of course grows together with the expansion of the 
Universe). If the expansion rate of the Universe is a power-law in time, i.e., if 
the physical scales R ~ t m , the expansion counteracts the gravitational attrac- 
tion and the perturbations' amplitudes now grow only as a power-law: Sq ~ t n . 
Finally, if the universe is expanding exponentially (as it does if a cosmological 
constant dominates the evolution), perturbations are asymptotically static, that 
is, their amplitude tends to a constant, [j] 

1 We should note that in fact there are always two solutions for the perturbations, one 
growing, the other decaying, and the remarks above apply to the dominant (growing) modes. 
Neg lect of the decaying modes is usually justified, but sometimes can lead to crucial mistakes 

mm- 
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Since the Universe at the present time seems to be expanding as a power- 
law, we conclude that it is also slowly becoming more and more complex, with 
growing voids and clusters of galaxies. But if there was a time when the Uni- 
verse inflated exponentially, then perturbations with large enough wavelengths 
A > Rh would have been frozen during this period. If by some mechanism 
perturbations of that scale were created with the same amplitude during an in- 
flationary phase, once that phase was over and a power-law expansion subsided 
the perturbations could then grow and start forming stars, galaxies, voids, etc. 
The spectrum of perturbations, that is, the amplitude of each mode with phys- 
ical wavelength A, would then be "scale-invariant" , which is consistent with the 
data available now. 

In mathematical language, we will solve Einstein's theory linearized around 
an expanding homogeneous background geometry || [| |J. A crucial issue is 
related to the freedom of gauge, that is, the symmetry under general coordinate 
transformations. It turns out that it is possible to build variables which have the 
same value in all coordinate frames, i.e., gauge- invariant variables, and those are 
of course the physically meaningful quantities. In the next sections we consider 
this theory in more detail. 

2.2 Relativistic cosmology: background models 

Clearly a Newtonian approximation to the description of the Universe in its 
largest scale is insufficient: since the expansion rate is now 65± 15kms~ 1 Mpc~ 1 
(see Q for an account of the latest measurements), galaxies at a distance of 
lOOQMpc from the Milky Way, for example, are receding from us at 20% of the 
velocity of light. The expansion rate gets even bigger as we look back into the 
past, thus only Einstein's relativistic theory is capable of providing an accurate 
description. 

In a perfectly homogeneous and isotropic Universe, there are no preferred 
directions or axes in space, and the geometry, the metric and all other physical 
parameters depends only on time. We write the line element in this "cosmic 
time" as 

ds 2 = dt 2 - a 2 (t)dx 2 , (2.1) 

where a(t) is the scale factor. The spatial coordinates x are called "comoving 
coordinates" , and are related to physical distances by r — a(t)x , hence the name 
"scale factor" . If we could sprinkle dots over the universe, the physical distances 
between dots would increase if the Universe was expanding and decrease if 
the Universe happened to be contracting, but the comoving distances between 
dots would nevertheless remain always the same. Eq. ( |2.1[ ) is known as the 
Friedmann-Robertson- Walker (FRW) metric on flat space (we will not discuss 
here the trivial but cumbersome cases of open and closed Universes) 



7 



It will be practical, in certain applications, to use a different clock or time- 
slicing, where the time variable (called "conformal time") is di] — dt/a(t) . In 
this case we can write the line clement as 



ds 2 = a 2 (77) (dry 2 - dx 2 ) 



(2.2) 



The time derivative of a function / with respect to conformal time, /', can be 
related to the derivative with respect to cosmic time, /, using the rule 



drj J y ' dV 



(2.3) 



The reader can freely exchange between the cosmic and conformal time pictures 
at any time. 

The equations determining the kinematics and the dynamics of matter in 
General Relativity are the Einstein Field Equations^, 
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(2.4) 

where is the Einstein tensor (which expresses a curvature of spacetime) and 
T^y is the energy-momentum tensor (EMT) of matter. 

We will later consider more extensively the case of scalar field matter, where 
the EMT is given by 



(2.5) 



where V(ip) is the potential of the scalar field (p. 

For now, let us examine the more common cases of hydrodynamical fluids 
such as pressureless matter (dust) and radiation, whose EMTs can be expressed 
as 
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(2.6) 



where p(t) is the energy density, and the pressure p(t) is related to the energy 
density by the equation of state p = wp with w constant. For matter we have 
w c i = 0, while for radiation w r = 1/3. It is useful to note that when the 
dominant component of the Universe is a cosmological constant, the EMT can 
also be written as in ( p.6[ ), with iv\ = — 1. 

In t he homogeneous and is otro pic background of a FRW Universe, with 
metric (2.2), the field equations (2.4) are diagonal and the time-time and space- 
space parts read, respectively, 

2 We use units where 8irG = M~, 2 = 1 . 
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3H 2 = p 
- 2,H 2 -2H = p 



(2.7) 

(2.8) 



where H = a/a is the Hubble parameter, which measures the expansion rate of 
the universe. The inverse of the Hubble parameter ff _1 has dimensions of time, 
and is in fact a fundamental time scale in cosmology, since it is proportional to 
the age of the Universe - around 14 billion years, give or take a couple of billion 
years. 

The geometric fact expressed by the Bianchi identities, G>ff v = is recip- 
rocated on the rhs of ( |2.4| ) by the continuity equations, T'^ v = (semi-colons 
denote covariant derivative). The continuity equations are expressed, in a FRW 
Universe, by the constraint 



p + 3H(p + p) = 



(2.9) 



It is easy to see that the identity above is just the intcgrability condition that 
must be satisfied in ord er th at Eqs. (2/7) and (2_i) have a solution. We can 
find the first integral of (2.9), which is 



a(t) 



-3(l+iu) 



(2.10) 



Pressureless matter will have an energy density inversely proportional to 
the volume V~ x ~ a -3 , while radiation is not only diluted but also suffers 
from redshift, X p h = X/a(t), thus its energy density is proportional to a~ 4 . The 
temperature in radiation is related to its energy density by statistical mechanics, 
p r ~ T 4 , and thus T r ~ a -1 . A cosmological constant (w\ = — 1) have constant 
energy density. An important consequence of these equations is that, as we 
run the clock backwards (thus decreasing a), the energy density of radiation ( 



a ) increases more rapidly than the one in matter 



) while the energy 



density due to a cosmological constant stays constant. We thus conclude that no 
matter how small the energy density in radiation is right now, it will necessarily 
be dominant in the early Universe. 



Substituting the above expression into Eq. (2.7) yields the solution for the 
scale factor as a function of time, 



if w 7^ — 1 and 



(2.11) 



a(t) 



Mot 



(2.12) 



if w = — 1. In this last case, called de Sitter spacetime, the scale factor grows 
exponentially and the Hubble parameter is a constant, as evident in Eq. ( |2.7[ ) . 
The fact that for non-exotic matter (such as a cosmological constant) the scale 



Figure 2.1: Surface of last scattering (dotted circle) of the cosmic microwave back- 
ground radiation. Photons that emerged from overdense regions after the time of 
decoupling are more redshifted then photons that came from underdense regions. 

factor goes to zero when t — > is an intrinsic feature of General Relativity, and 
this special case of space-like singularity is called the Big Bang. 

At the present time the Universe is dominated by pressureless matter, with 
a fraction (1CT 4 ) of the energy density contained in a background of photons 
which occupy the microwave band with a temperature of 2. 73° K (thus the name 
Cosmic Microwave Background Radiation - CMBR). Approximately 1,000,000 
years after the Big Bang, though, the energy density is still just high enough 
that the temperature of the CMBR can ionize Hydrogen atoms. After this 
moment (sometimes also called recombination), when X^ec ~ 3000° K, matter 
becomes transparent to radiation and they decouple from each other. 

An important consequence of recombination is that photons did not scatter 
from matter after the time of decoupling tdec up to the present time. In other 
words, they traveled in a straight line since this "surface of last scattering" at 
tdec (see Fig. 2.1), and if we can set the temperature of our experiment low 
enough that we can detect those photons, we will be seeing a picture of the last 
events that took place at this time of decoupling. 

That is the content of the map of the CMBR sky produced by the COBE 
satellite. Photons coming from the surface of last scattering have an average 
temperature of 2.736°K, with fluctuations in the quadrupole of about one part 
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in 5 x 10~ 6 , or lOmK. There is also a dipole that was subtracted in the CMBR 
map which is ascribed to a peculiar motion of our galaxy, with respect to this 
"ether" of primordial photons, of about 370 km s . 

Notice that the presence of this background of radiation introduces a class 
of preferred reference frames (coordinates) for observers in the Universe: the 
comoving observers^. It would make little sense to utilize a reference frame 
which is moving with respect to the background photons, since it would intro- 
duce dipoles in the CMBR, in the Hubble diagrams, and indeed in most all-sky 
observations. 

It is important to stress this fact: when talking about cosmology, we are 
going to consider only reference frames (or coordinate frames) which are, within 
a first approximation, static with respect to the cosmic background radiation. 
When we consider gauge transformations, in this context, we will be always 
considering transformations between coordinate frames inside this class of quasi- 
static comoving frames. A precise mathematical formulation of this point will 
be given in Section 5.1. 

Before concluding this section let us examine the important case of scalar 
field matter, which will be crucial in the next sections. If the Universe is filled 
with a homogeneous scalar field <fo(t) with potential energy V(ipo), the EMT is 
diagonal with energy density and pressure given by 



When the potential term dominates, the EMT is identical to the EMT of a 
cosmological constant A = V(ipo). If such a scalar field is dominant over other 
forms of matter, the Universe will expand exponentially - or inflate - like de 
Sitter space, until the kinetic terms become comparable to the potential term. 
If this inflationary period lasts long enough, an apparent event horizon is created 
due to the exponentially fast expansion rate. Two observers in opposite poles 
of a sphere with radius the size of the horizon Rh will not be able to exchange 
signals, because the expansion of the Universe is so fast that those signals cannot 
reach the other poles. Of course, once inflation is over those signals can progress 
towards the other observer once again, but during inflation we can define this 
horizon. Clearly, the faster the expansion rate the smaller the horizon size, 
and in fact, Rh = H . The existence of horizons during periods of quasi- 
exponential expansion will be a welcome feature of inflationary models that we 
explore later on. 

3 Peculiar velocities of galaxies with respect to the comoving frame can indeed be inferred 
from the way in which those galaxies feel the background radiation, and underlies the Sunyaev- 
Zcl'dovich method for measuring velocities of distant galaxies. 



Ps = + V((f ) 

1 2 , . 



(2.13) 



(2.14) 
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In many situations (during the reheating phase after inflation, for example) 
the scalar field is oscillating coherently around the zero of the potential with 
amplitude 2(fo, and then on average the kinetic term equals the potential term, 
implying that p s = and p s = 2V((po). 

The continuity equations in the case of scalar field matter are just the equa- 
tion of motion for (p (the Klein-Gordon equation in a FRW spacetime) and 
read 



<pa + 3H<p + V <p ( V >a)=0 . (2.15) 

In general the first situation described above (potential energy dominance) im- 
plies that the first term in ( 2.15 ) can be neglected. In that case ipo = — V/3H 
and the scalar field is "slowly rolling" down towards the minimum of the poten- 
tial. The second situation (coherent oscillations) usually means that the friction 
term SHifio is negligible. 

We conclude then that if the Universe was initially dominated by a scalar 
field displaced from the zero of its potential, it would inflate exponentially until 
the friction term became small, and then it would expand in a power-law just 
like in a matter-dominated Universe with p s = 0. 



2.3 Perturbations of FRW spacetimes 

The mosaic of hot and cold spots in the COBE map of the CMBR shows that 
some regions of the early Universe were slightly hotter (and therefore denser) 
than others. Since energy curves space, this implies that the metric was also 
largely homogeneous, with some small bumps corresponding to the hot and 
cold spots. Call e the adimensional measure of the magnitude of these inhomo- 
geneities. In the case of our Universe at the time of decoupling, e is, explicitly, 



ST 

^dec — rp 



1(T 5 , (2.16) 



where ST denotes the fluctuations in the temperature field. Metric perturbations 
can be preliminarily thought of as proportional to the variation of the newtonian 
potential, which in turn is proportional to the temperature fluctuations of the 
primordial plasma. The constant e thus regulates the size of both matter and 
metric perturbations, and in fact can even be related to the small parameter 
that enters when we consider gauge transformations (see next section). 

Although the background variables depend only on the coordinate rj (we use 
conformal time here for convenience) the perturbed variables can vary along all 
the 4 coordinates. The scalar field, thus, will be written as 

<p = <po( , n) + eS <p('n,2) > ( 2 - 17 ) 

and the metric as 
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g»u = + ^g^(v, x) (2.18) 

where is given in ( |2.2| ). 

T„=a 2 w(j • (2-19) 

The metric perturbations 8g^ v of a homogeneous and isotropic background 
( |2.19| ) can actually be classified into scalar, vector and tensor, each category 
belonging to different "multiplets" of the group of spatial coordinate trans- 
formations on a fixed time slice (for a review, see Jl3|). Scalar perturbations 
transform between each other when the coordinate transformation can be ex- 
pressed in terms of a 3-scalars; vector perturbations mix with other vector per- 
turbations when the transformation is due to a traceless 3-vector; and tensor 
perturbations (also known as gravitational waves) are unchanged since there is 
no way by which a genuine traceless, divcrgcncclcss 3-tensor can produce a coor- 
dinate transformation. We will concern ourselves mostly with scalar and tensor 
perturbations, since in most applications vector perturbations decay rapidly. 
As would be expected, each perturbative "mode" couple to different sectors of 
matter, in such a way that there is no correlation between the 3 categories, and 
we can analyze them separately^. 

The scalar perturbations can be constructed in terms of 4 scalar quantities, 
</>, ip, B and E: 

«e.-<fM(\ *4 -*,,]) ■ (2 - 20 » 

Vector perturbations can be written in terms of two transverse (or solenoidal) 
3-vectors Si and Fi (if they were not transverse we could extract scalars - the 
divergences S\ and F\ - from them and construct new, transverse vectors) 

Finally, tensor perturbations can be cast in terms of a traceless, transverse 
3-tensor hij. Again, if the constraints h\ — or h 1 ■ i — were not satisfied, we 
could extract a scalar or a vector from hij and then construct a new tensor that 
would be traceless and transverse. The metric perturbations for gravitational 
waves are thus 

hi. - -An) ( I I ) ■ (2.22) 

The accounting for degrees of freedom is the following: 



This is valid not only in linear theory, but even to second order, as we will see later 
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4 d.o.f. for scalar modes (0 constraints); 
4 d.o.f. for vector modes (2 constraints); 
2 d.o.f. for tensor modes (4 constraints), 

giving a total of 10, which is the number of independent components of the 
symmetric tensor Sg^ u . 

The splitting above, into scalar, vector and tensor modes, is specific to the 
FRW background, and would not be valid if the 3-space was not homogeneous 
and isotropic. In the case when the background is a Schwarzschild geometry, 
which is symmetric on two of the variables (the polar and azimuthal angles) 
but not on the radius, perturbations can be separated into similar groups that 
transform according to rotations in S 2 . 

2.4 Diffeomorphism transformations 

General Relativity is a diffeomorphism- invariant theory, i.e., its action and phys- 
ical observables are symmetric under transformations that take any one coordi- 
nate frame continuously into another. A general infinitesimal coordinate trans- 
formation takes a system of coordinates {x^} of a given manifold M. into a 
different coordinate system {x^} in such a way that the structure of the man- 
ifold remains the same. This mathematical definition means that physically 
meaningful observables such as the topology, the existence of singularities or 
the amplitude of the quadrupole anisotropy in the CMBR are notions which are 
independent of the coordinate frame we happen to choose. 

In infinitesimal form, we can write a general gauge transformation as 

Xp ► Xp — Xp ~\~ s^p , (2.23) 

where £ is a small parameter (later e will be identified with the e that measures 
the relative magnitude of perturbations) and £(x) is a general 4- vector^. Here 
we present a simplified version of the treatment given in Chapter 5. For a fuller 
discussion of the issue of perturbations of homogeneous spacetimes, see and 

Now the question is, how do tensors transform under a gauge transformation? 
In fact tensors by definition transform according to the law 

e- (.,) - e: <*,) = (||) p ■ ■ ■ (g) r ■ ■ ■ <%::. M ■ (2.24) 

Notice that on the l.h.s. of this equation the components q are evaluated at the 
coordinate value x p —x p + e£p. Substituting this expression for x into Eq. 

5 Whenever possible we will drop tensorial indices, so here for example £ and x mean clearly 
the 4- vectors £^ and x^. A generic tensor q^... will be denoted only by q. 
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( |2.24 ) and expanding both the left and the right-hand-sides yields the definition 
of the Lie derivative 



q{x P ) = q(x P ) - e£^q{xp) + 0(e 2 ) . (2.25) 

The Lie derivative can be interpreted as being the differential operator that 
transforms tensors without changing the coordinate point at which the tensor 
is evaluated^} We speak then of "Lie dragging" tensors over real points whose 
coordinate values have been fixed, along a line with tangent vector £. The 
difference between tensor components before and after the dragging defines the 
Lie derivative. 

The algebraic definition of the Lie derivative is quite straightforward: it acts 
on 4-scalars as 

C S S = S„f , (2.26) 

on vector fields as 

(£^r = <r-^c - ( 2 - 27 ) 

on 1-forms (covariant vectors) as 

(£ € u) M = u^ a C + u a Q , (2.28) 

and so forth for higher-ranking tensors. The case of the metric tensor g^ v is 
particularly important: 



g»u = g»u - £ OQfl)^ = g^v - sg»u,a£ a - eg av ^ - sg^t.% ■ (2.29) 

A coordinate transformation that leaves the metric unchanged is called an isom- 
etry, and vector fields ( (called Killing vectors) that generate isometrics obey 
the Killing equations^. 

" 9auQ ~ g^C = -Cmu - Cvi„ = • (2.30) 

We want to argue now that, to our purposes, the small parameter e is the 
same as the e introduced before as the measure of the strength of the pertur- 
bations in FRW spacetimes. As we discussed, we will be concerned only with 
coordinate frames which describe the Universe as approximately homogeneous 
and isotropic. This means that the background spacetime is constrained to be 

e This is known as the "passive" interpretation. The equivalent "activ e" picture is more 
geometric and prescinds of any mention to coordinate frames (see, e.g., |jlq, [17))- We will 
discuss in more detail the passive interpretation when we speak of Finite Diffcomorphisms in 
Chapter 5. 

7 It is useful to note that in the Lie derivatives all connections cancel, making the normal 
and the covariant derivatives equivalent in those expressions. 
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described by the metric (2.2) and the homogeneous scalar field ipo(rj), up to a 
trivial time reparametrization. We can still allow, though, for coordinate trans- 
formations that change the form of the matter and metric perturbations Sip and 



Sg^y. Equation (2.29) reads then 



<W = %u + eS 9^=l^u + eSg IM/ -e[L e (j + eSg)] fil/ 

= lnv + eSg^y - e-t^ a £ a - - £7^C + °( ee ) ' ( 2 - 31 ) 

which after subtracting the background yields 

eSgpvix) = eSg^{x)-ej^ a C(x)-£lai>^(x)-£J IJ ,aCv{x)+0{ee) . (2.32) 
The matter perturbations change similarly, and it is easy to see that 

eSip{x)=eSip-£ip, a £ a {x)+0{e£) . (2.33) 

From the equations above we can conclude that if the magnitude of perturba- 
tions is to be preserved under gauge transformations, then e < e. In case of 
equality, the gauge transformation can potentially redefine the perturbations; 
if e <C e the gauge transformations do not alter significantly the perturbations, 
and we can ignore their effect. We will use the saturated bound e = e and drop 
their distinction altogether, since we are interested in gauge transformations 
that are large enough that they actually affect the perturbations. Of course this 
upper bound constrains the class of gauge transformations that are allowed, 
but the price of not going along these lines is giving up a well-defined notion of 
background. In order not to make notation too heavy, we drop any reference 
to e, and without ambiguity assume it implicit in the definitions of £ and the 
perturbations. 

If we separate the metric perturbations into the three categories described 
in the end of the last section (scalar, vector and tensor), gauge transformations 
do not destroy this classification. In a FRW background spacetime any 4-vector 
£(x, rf) can be decomposed in a 3+1 fashion into two 3-scalars and one transverse 
3- vector, 

£ /1 =[£°,tf+&] > ( 2 -34) 
where 77) and £ s (i?, 77) are 3-scalars and Ct(^; 7 ?) i s a 3- vector such that 

e T)i = o. 

Using the general coordinate transformations above, the metric perturba- 
tions Sg^+Sg^+SgT giv en in ( 2.2C )-( 2~2^ ) and the background metric ( 2.19 ) 



and substituting into Eq. (2.32) we find that metric perturbations change in 
the following way (a prime denotes derivative with respect to conformal time, 
a' = da/dvj): for scalar perturbations, 
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(2.42) 

finally, the tensor perturbations (gravity waves), as we noted, do not transform, 

hij — > hij = hij . (2.43) 

On the other hand, matter perturbations are transformed according to their 
character: scalar field perturbations are simplest, 

dip -► 5(p = 5lp - tp' f . (2.44) 

Background 4-vector fields in homogeneous and isotropic spaces are such that 
their spatial components vanish (a non-vanishing component would define 
a preferred orientation and break isotropy), so = [v9 ^(rf), 0). Perturbations 
of vector fields in 3+1 can be decomposed into a transverse 3- vector field plus 
two scalar fields, Sv^ = [Sv°, 5v l T + Sv' 1 ] and their components transform as 

5v° - 5v°=5v -v (0) 'e + v° (0) e' , (2.45) 
Sv -> Sv = Sv + v? Q) f' , and (2.46) 



Sv l T -> 5v l T = Svt + W( 0) Ct • (2-47) 

This separation can be carried out to tensors of arbitrary rank. 

Finally, let us comment on the problem of the "choice of gauge" . The metric 
tensor has 10 components, but due to the diffeomorphism symmetry, only 6 are 
truly dynamical variables. In order to "fix" a gauge we must impose constraints 
on the metric variables. From the definition of the diffeomorphism generator, 



Eq. (2.34), we draw that there are 2 constraints that can be imposed on the 



4 scalar modes, and 2 that can be imposed on the 4 vector modes (the tensor 
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modes are already properly constrained by the TT conditions). There are then 6 
dynamical variables in the metric tensor: 2 scalar, 2 vector and 2 tensor modes. 

We must stress that the constraints on the metric perturbations should com- 
pletely remove the gauge freedom. Some gauge choices fail to do so (the syn- 
chronous gauge defined by the constraints 8go^ — is a typical example), and 
the price to pay for this shortcoming is having to deal with gauge modes that 
grow in time and obscure the calculation. Wc will work later with a gauge choice 
(longitudinal gauge) that eliminates completely the freedom under coordinate 
transformations . 



2.5 Gauge invariant variables 

The transformation laws for scalar, vectors and tensor perturbations show that 
they have distinct expressions in distinct reference frames. Now that we know 
how the gauge symmetry of gravity influences perturbations, we can try to find 
physical observables that can describe those perturbations in an invariant form. 
A quantity is said to be "gauge invariant" if it is independent of £ when the 
reference frame is transformed by x — > x — x + £. Geometric quantities such as 
tensors (scalars included) are covariant, not invariant, and are not included in 
this category. 

However, Bardeen[jl8| was able to construct non-geometric quantities defined 
with respect to a particular coordinate frame, which take the same values in all 
nearby frames. An example of a non-geometric quantity would be the sum of a 
scalar with the — component of a mixed tensor of rank (1,1). 

In this monograph we will follow a slightly distinct but equivalent construc- 
tion than the one carried out in the standard references f0| g [L9|. While 



this treatment is less straightforward, it will prove much more useful when we 
generalize gauge transformations in Chapter 5. 

First, we note that £ is completely general, and that it has the same order 
of magnitude as the perturbations (the factor e that we dropped earlier). We 
can choose its components £° , £ s and Q to be arbitrary functions in a particular 
frame. In particular we could choose them to be linear combinations of the 
components of the metric perturbations, and in fact that is what we should do 
if we would like to go to a frame where, e.g., E — : by Eq. (2.39) we easily 
see that we would have to make a gauge transformation with £ s = E. 

Call X^ [Sg] the special kind of "vectors" defined in terms of the perturba- 
tions such that, under a gauge transformation, 

X^[Sg] -> X»[6g] = X^[5g] + f . (2.48) 

In other words, under a coordinate transformation generated by £, the vectors 
X transforms like the coordinates. What we mean by ( (2.48 ) is that X^ are 
linear functional of the components of the tensor 5g^ v which, under a gauge 
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transformation, change in such a way that X^, written in terms of Sg^, is 
related to by (fO|). 

Suppose now that we found at least one vector X that conforms to our 
specifications. Consider then the following quantities Q„... defined in terms of 
a generic tensor q„. '. '. with the help of X: 

Q = q + C x q . (2.49) 

If q has a perturbative expansion q = qo(rf) + Sq(rj,x), then naturally the back- 
ground is the same, Qo = Qo, but not the perturbations: 

SQ = Sq + Cxqo ■ (2.50) 
The transformation law for SQ can be calculated as follows: 



SQ = Sq + £ x q = Sq- £^0 + Cx+t^Qo 

= Sq + £ x q Q =5Q . (2.51) 

Therefore, the quantities SQ = SQ are gauge invariant variables! It is possible 
then to define gauge invariant quantities related to the perturbations of any 
background variables, provided that we can find a vector X that satisfies the 
transformation law ( |2.4<*| ). 

With the help of the transformation laws for the metric perturbations it is 
easy to extract the components that transform in the desired way. Since there 
is more than one solution for each component X°, X s and X%, we label them 
as follows: 



X° = B-E' , (2.52) 



A (2) - 
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a' 


A (3) - 
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X S(1) = 
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Xs(2) = 


-1 


^T(l) 




VV T(2) — 


-1 
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a 



- j dr,(B - X°) , 



In the definition of X^ the last term can be any linear combination of X?^ 
and X^ with norm 1 (see below). 
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For each one component we can actually use various definitions, as long as 
they sum up to unity. For example, we could define 



X U ~ l X (i) + 2 X (2) > ( 2 - 53 ) 

or 

X sw = 0.78X S(1) + 0.22X S(2) . (2.54) 

There is an infinite number of definitions that we could choose, corresponding 
to an infinite number of ways of defining gauge invariant objects. We would 
prefer the simplest possible definition, that in addition is regular if we choose to 
switch off the expansion of the Universe and is not nonlocal (does not include 
integrals). We will also neglect the vector modes, since they do not play a 
significant role neither in structure formation nor in back reaction, which is the 
object of this work. 

We are left with only one simple choice, 

X° = B-E' , (2.55) 

and 



X s = X = -E (2.56) 

and, of course, X % T = since we are not going to allow for vector modes in 
any reference frame. With these definitions we can compute the explicit expres- 



sions for gauge invariant metric components. Substituting in Eq. (2.50) and 
separating the components we have 



* $ = <j>+-[a(B-E')]' , (2.57) 
a 

ijj * 9 = <f>--(B-E') , (2.58) 
a 

B ^ and 
E * . 



The gauge invariant variable related to the scalar field perturbation is also 
defined in Eq. ( ^50| ), 

Sip * 8tpgi = 6<p + ipo'(B-E') . (2.59) 

While the construction detailed above is undeniably more involved than the 
one usually encountered in the literature, in the end of the day it proves simpler. 
The classical construction consists of looking for gauge invariant combinations of 
components of a given tensor with the metric perturbations. In our construction, 
once the X are properly identified and classified (a trivial procedure), it is a 
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matter of simple algebra to calculate the gauge invariant variables corresponding 
to any tensor. It will also prove much simpler to deal with once we go beyond 
the first order in perturbation theory in Chapter 5. 

So far we did not impose any constraints on the metric, therefore the "gauge" 
is completely general. Since we have excluded the vector modes from the analy- 
ses (2 constraints), we still have to impose 2 conditions on the scalar functions. 
We could choose, for example, the synchronous gauge = and B = 0, but 
these 2 constraints are not sufficient to completely eliminate the gauge freedom 



(see [P-3| ). We will use instead the constraints that make definitions ( 2.55 ) and 
( [2.56; ) easiest to compute, i.e., B = and E = 0, known as longitudinal gauge 
(since the metric perturbations are diagonal) . There are three good reasons for 
this choice of gauge: one is that, as opposed to synchronous gauge, longitudinal 
gauge does not carry any "unphysical modes" , therefore we do not have to deal 
with "ghosts" in the calculation; two, <f> can be connected to the gravitational 
potential in the Newtonian limit, and it is desirable that we can easily take 
that limit if we wish to; finally, in the gauge where B = E = 0, the dynamical 
variables <f> and -0 actually coincide with the gauge invariant variables <I> and iff, 
respectively. 



In fact, with the choice of the vector X as in (2.55) and (2.56), all gauge- 
invariant variables are coincident with their expressions in longitudinal gauge. 
This suggests an elegant way of writing the equations for cosmological per- 
turbations in a completely gauge invariant manner: we derive the equations 
of motion in longitudinal gauge, then substitute 0, ip and 8(p for their gauge 
invariant counterparts $ and 6tp g i. 

Since we are going to use the longitudinal gauge for the remainder of this 
paper whenever we speak of scalar perturbations, we can drop the notational 
distinction between the gauge invariant variables and their corresponding lon- 
gitudinal gauge variables. Thus from now on we restrict ourselves to the scalar 
metric perturbations given by 



It goes without saying that gravity waves do no interfere at all with the discus- 
sion above, since they are gauge invariant to start with. 

2.6 Equations of motion for the perturbations 

The propagation of linearized perturbations in a background spacetime is given 
by the Einstein field equations, expanded in a Taylor series around the back- 
ground metric 7^ and matter fields ifo, truncated at the second order. We will 
use the following notation: 

= Gyp [g] - [g,ip] = n M „ [g, ip] 
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= IL$[ 7 ,<p ] +611^59, 5<p] . (2.61) 

The tensor IljS is zero if the background equations G$ = T^J are satisfied, 
which of course must be solved prior to the linear equations SG^ = 5T^ V . It 
is straightforward but lengthy to derive the linearized equations of motion, and 
it is widely available in the literature [^3). We simply state here the results for 
the cases of gravity waves (tensor perturbations) and scalar perturbations. We 
neglect the vector modes, since they do not appear in most models of structure 
formation. 

Einstein's equations linearized around a FRW background spacetime imply 
the following equations of motion for the tensor modes: 

h'lj + 2Hh' tJ - V 2 /i„ = (2.62) 

where Tt = a' /a and a prime, as before, denotes a der ivativ e with respect to 
conformal time. Notice that the rig ht-hand-side of Eq. J2.62]) does not contain 



any matter source, which is the reason why gravity waves do not couple to 
matter. 

It is useful to make the substitution 

hij&v) = — / -—rj, Mv)^(k)e tk ' x + c.c. , (2.63) 



(77) J (2tt) 3 / 2 



in which case the equations of motion for ii{rf) read simply 

Mfc + ( k 2 - — )nk = . (2.64) 



Due to the constraints on the tensor modes, h\ — hj i — 0, the polarization 



tensor in ( p.63| ) is a symmetric and transverse-traceless (TT) tensor, A = 
Eijk 1 = 0. Consider a gravity wave moving in the z direction, that is, k = 
(0, 0, k z ). The constraints imply that the polarization tensor for this plane wave 
is given in terms of the two variables e xx = —e y y and e xy = e yx . For each 
plane wave there are then 2 degrees of freedom, transverse to the direction of 
propagation of the wave. 

When the background expands in time as a oc r/ p , the equation can be solved 
in terms of (spherical) Bessel functions: 

fi(k, V ) = v 1 ' 2 [C 1 (k)J p - 1/2 (kr))+C 2 (k)Y p _ 1/2 {kr))] . (2.65) 

These solutions can be easily expressed in cosmic time by using a{if)dr] — rpdr\ — 
dt, therefore when p =/= —1 we have r] = t 1 /(J 5+1 ) and a(t) = t p ^ p+1 \ while 
= e Hot and a(t) = e Hot if p = -1. 
Scalar perturbations of the metric are much more complicated since they 
couple to matter perturbations, which appear as a source term in the RHS of 



22 



the linearized Einstein equations. They are also much more interesting, since 
they gave rise to the large-scale structure that we observe. The metric for scalar 



perturbations in longitudinal gauge, given in Eq. (2.6C), can be simplified if we 
take into account that the non-diagonal space-space (i ^ j) components of the 
EMT are identically zero^]. This implies that tp = <j), since the i ^ j components 
of the Einstein tensor are proportional to V,Vj (<f> — ip). With this identification, 
we drop ip and the — 0, — i and i = j equations for the perturbations are, 
respectively^, 

2V 2 <j)-m<j>' -2{H' + 2H 2 )(p = <p' 6<p' + a 2 V v S(p , (2.66) 
24>' + 2H(p = tp' 5tp , (2.67) 
2<t>" + m<j>' + 2{H' + 2H 2 )(p = <p' 6<p' - afV^Sip , (2.68) 

where 7i — a' /a. 

Only two of the three equations above are linearly independent, which can 
be verifyed by using the background identity 

2(H 2 - W) = <^ 2 (2.69) 
and the equations of motion for the scalar field, 

<f% + 2?V + a 2 V v = . (2.70) 



TherhsofEq. fl2~66| ) is in fact just twice the perturbed energy density of the 



scalar field, 8p s . If we switch off the expansion of the Universe {TL, Ti' — > 0) this 
equation reduces to Poisson's equation V 2 = 5p, where the Newtonian gravi- 
tational potential has been replaced by the the relativistic metric perturbation 
0. 

In addition to the Einstein equations, the perturbed scalar field obeys the 
linearized Klein-Gordon equations in a curved FRW background. These can 
be obtained either directly through the continuity equations V M T M „ = 0, or as 
the integrability condition on the system ( 2.66| )-(2.6S), since the first equation 



is just a constraint and the second a true dynamical equation (second order in 
time derivatives). In cither case the equation for the evolution of Sip is 



V' + 2W<V - + 2a 2 V^4> + a 2 y w ^ - V 2 ^ = 



(2.71) 



We can simplify the system even more by using the constraint (2.67) to 
express 5<p in terms of 4>- After some algebra and the use of the background 
equations ( 2.70| ) and ( 2.69| ) we get the surprisingly simple wave equation 

8 This is also true of scalar hydrodynamical perturbations when there is no anisotropic 
stress. 



9 Conversion to cosmic time dt 
f" = a 2 (/ + d/). 



adr] is easily done using the identities /' = af and 
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u" - V 2 u - Mjj(r))u = , (2.72) 
where we introduced the variable 

u(ri,x) = -y (f)(r],x) (2.73) 

and the auxiliary time-dependent "mass" 

Mfo) = ^^f- ■ (2.74) 



The main reason we use conformal time in this section is that Eq. ( 2. 72] ) looks 



very simple. In fact, in this form the problem of evolution of perturbations turns 
out to be just scattering a wave off a time-dependent potential. The technique 
to solve this problem is self-evident: we will decompose the wave into a linear 
combination of plane waves 

{ H 3 k 

where x = r/a are comoving coordinates, and k is comoving momentum (the 
physical momenta k p h are related to comoving momenta by k = ak p h). We treat 
each fc-mode separately, since there is no coupling between modes to linear order. 
In momentum space, the equation of motion for Uk{if) is 

< + k 2 u k - M$u k = . (2.76) 

This equation can be solved in two asymptotic situations: the long wave- 
length (or low frequency, LF) limit, k 2 <C Mq and the short wavelength (high 
frequency, HF) limit k 2 3> Mq . The inverse of the "mass" Mq -1 is in fact a length 
scale proportional to the radius of curvature of the Universe. For instance, when 
the dynamics of the background is dominated by a scalar field potential and the 
Universe is inflating then Mq -1 w (ai?) _1 , that is, this "mass" scale is only the 
horizon scaleQ. In this case, we speak of modes bigger (LF) and smaller (HF) 
than the horizon. 

The HF limit is simplest, since we can neglect the time-dependent mass Mq. 
Physically, it means that perturbations of very small wavelength do not feel 
the curvature of spacetime (although they still feel the expansion, since their 



physical wavelength is being stretched), and the solutions of (2.76) are 



ui HF)± (r,) = Ai HF ^e ±ik » . (2.77) 



10 When the scale factor is increasing exponentially the horizon size in physical coordinates, 
Rh = H , is approximately fixed. In comoving coordinates, though, a fixed physical scale 
is exponentially smaller, = Rh/ 11 ~ Rn e Ht - 
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If the expansion of the Universe is stretching the wavelengths faster than its 
curvature radius is increasing, some modes will eventually become larger than 
the horizon. During an inflationary period this process becomes explosive, with 
modes crossing the horizon at an exponential rate (remember that the horizon 
scale in comoving coordinates is Rh/cl)- 



For LF cosmological perturbations with k 2 <C Mq , the solution to Eq. ( 2.76 ) 
is, after some algebra, 



,(LF) 



l 



(2.78) 



(p' [a 

Notice that there is an additional integration constant implicit in the integral in 
( [2.78] ), so that both solutions u^ F ^ and uj^ F ^ need to be complemented by 2 
"boundary conditions" - typically the metric perturbation and its time derivative 
on a given time slice. The general solution to second-order wave equations 
always involve two linearly independent solutions. There is one dominant and 
one decaying solution for each mode, and we usually ignore the decaying one, 
except when making the transition between two phases with different expansion 
regimes or when connecting a HF mode that crosses the horizon to the LF 
solution outside the horizon. 

Now it is convenient to write the metric perturbations in terms of cosmic 
time t (for time derivatives, use that drj = dt/a). The sub-horizon modes are 



i HF) 



a + (HF) +ik (dt/a 
A k e 



A- k {HF) e 



ik J dt I a 



(2.79) 



where the integration constants in the exponents can be absorbed in the defini- 
tions of A^ F '^ so that there are really only 2 constants. Super- horizon modes 
behave like 



i LF) 



A (LF)D , 1 



H 



adt 



(2.80) 



where the index D will become clear in a minute. 

To make the discussion more concrete, let us consider the case when the 
background expands in a power-law, a ~ t p with p > and p ^ 1 (we exclude 
the case p = 1 for simplicity). There are two different regimes that are possible 
with this type of scale factor, for which the deceleration parameter q = |§ is 
bigger or less than zero: the case p < 1 gives q < and is decelerating, while 
the case p > 1 is accelerating, so q > 0. 



We can easily perform the integral in (J2.79) , and the solution reduces to 



At 



l—p k 



(2.81) 



where to < t and kg are constants. In the case p < 1 the exponent increases with 
time, hence the perturbations oscillate with a growing frequency and there is no 
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dominant solution. When the expansion is accelerating, p > 1, the frequency 
decays and the cosine mode becomes dominant, since the sine mode becomes 
small as the argument in the exponential goes to zero. 
LF modes in these backgrounds have the form 

In both cases, p > 1 and p < 1, the constant term is dominant, while the second 
term fastly decays. This result is very important, and holds even when the scale 
factor grows exponentially. We say that perturbations on scales bigger than the 
horizon are "frozen", i.e., the dominant mode in their amplitude is constant in 
time. 

The detailed description of spacetime in terms of perturbations is encrypted 
in the "spectrum" {Afc}, and this description remains valid as long as the in- 
teractions are approximately linear. For example, if we could deduce the exact 
form of the spectrum at an early phase in the evolution of the Universe, when 
nonlinear structures were still irrelevant, we would be able to determine all the 
structure formed thereafter, including the times when nonlinearities of a given 
scale became important. For the purposes of this work, knowledge of the spec- 
trum is crucial: it gives the information about how much amplitude there is 
in all modes, and therefore it also carries the key to quantities like the energy 
density and pressure carried by each of the modes. 

Of course, the precise form of the spectrum for all k cannot be deduced 
from first principles without invoking transcendental powers, but we will see in 
the next chapter that some physical processes in the early universe are effective 
enough to create spectra with very distinct qualitative features. We turn now 
to just such a mechanism: the inflationary Universe. 
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Chapter 3 



Inflationary Cosmology and 
Early Universe Physics 

There are several reasons to support the heterodox notion that the universe 
suffered some sort of accelerated expansion fueled by exotic matter during a 
tiny fraction of a second after the Big Bang. The puzzles left unsolved by Big 
Bang cosmology are, in order of importance^]: 

• the cosmological constant problem; 

• the horizon or causality problem; 

• the problem of the origin of structure; 

• the flatness (or entropy) problem; 

• the problem of primordial magnetic monopoles and domain walls.. 

As often happens historically the solution to the least important problem leads 
to an idea that elucidates other, more interesting questions. This solution can 
still leave out the most interesting problems: the cosmological constant remains 
a mystery in cosmology, although Inflation can solve all the other questions 
mentioned above. 

As mentioned in the last chapter, inflation's account of the rich structure of 
the universe is successful enough that this feature alone should be sufficient to 
take the scenario seriously (although it is not the only satisfactory mechanism 
on the market.) We discuss briefly the other compelling reasons that make this 
theory so attractive. 



1 Personal taste might interfere with the order 
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3.1 Puzzles of pre-inflationary cosmology 



• Horizon Problem 

A horizon at time th is, generically, a 2-dimcnsional surface denned with respect 
to a given reference point Ph that separates the universe into 2 regions, a "white" 
region containing Ph and a "black" region. If a horizon at point Ph exists at 
time th, an observer at that point cannot receive information (light signals, to be 
specific) from the black region. If the information was traveling from a distant 
point P p h in the remote past and could not reach the point Ph up to the time 
t p h, we say that there is a particle horizon. If the information started to travel 
from a point P e h at the time t e h but would never reach the point Ph, even after 
an infinite amount of time, we have an event horizon. The black regions in either 
cases are defined as the union of all points P p h or P eh , or, as the "geometrical 
locus" of points satisfying the respective causality conditions above. 

A typical example is the Schwarzschild black hole spacetime, which possess 
a horizon with respect to any point outside the Schwarzschild radius, since the 
white external region is causally disconnected from the the black interior of 
the hole. This case is unique, though, in that a Schwarzschild black hole has 
always been there. Hence both particle and event horizons can be defined, and 
they coincide in the "black hole horizon" . The case of a collapsing star is more 
realistic, though, and in that case only an event horizon is formed. 

In Cosmology, FRW spacetimes usually have either one of the two horizons. 
Consider the problem of propagation of light in an expanding universe, 

f*t dt 

ds 2 = dt 2 - a(t) 2 dx 2 =0 =► Ax = — , (3.1) 

Jti a 

with the scale factor 

a(t) = t (J-^J , (3.2) 

where to is the present time (or age of the universe) and p > (remember that, 
for pressureless matter p = 2/3, and for radiation p = 1/2.) 

The comoving distance traveled by light signals between the time interval 
(U,tf) is then given by 



1 

Ax = 



l-p 



tjY-" _(U 
to J \t 



(3.3) 



Let us answer first in which circumstances there is a particle horizon, and why 
they are important. 

If there is a finite maximal distance Ax p h that a photon could have traveled 
if it was emitted at time U — and received at any time tf, then there is a 
particle horizon. In this case, photons traveling from a spherical shell of radius 
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Axph towards the center of the sphere will just make it to the center, with 
photons coming from outer shells not being able to get to the center at the time 
t p h- From the expression to Air we see that U — gives a finite radius only 
when p < 1. For p > 1 the expression diverges and there is no particle horizon. 
Setting tf = tph we have then 



Particle horizons are important in Cosmology because, if one existed to- 
day, CMB photons received from opposite directions would provide information 
about two causally disconnected regions. To see this, compare the particle hori- 
zon at the time of last scattering of the CMB (the time of decoupling, t^) with 
the distance traveled by light from the surface to us: 



where we have taken p = 2/3 since the universe was very nearly matter domi- 
nated since decoupling^. We see that the particle horizon at decoupling was a 
factor 1/100 smaller than the particle horizon today. In the CMB sky map, this 
fraction translates into a square of a few degrees (the present particle horizon 
is the whole 360° sky). A few billion years from now this patch would be even 
smaller, reflecting the fact the particle horizon in the future has increased while 
the particle horizon at decoupling remains fixed. That is, as we go into the 
future, we gain access to more distant regions of the universe, regions that did 
not "know" about each other at tdec- 

A measurement of CMB radiation at the South pole and at the North pole 
would be, therefore, a comparison between physical properties (such as temper- 
ature) of two regions that had never come into causal contact until the moment 
those photons were observed. The fact that the CMB map shows a uniformity 
of 1 part in 10 5 despite the existence of particle horizons poses a problem to 
the usual radiation- and matter-dominated cosmological models. In order to ex- 
plain this uniformity in the framework of these models we have to invoke some 
acausal mechanism, like preparing the initial state of the whole universe (or a 
good part of it) right at the beginning of the Big Bang. We will see later how 
inflation can solve this problem, at least for the patch of the universe we happen 
to live in. 

Event horizons, on the other hand, are also important, although they do not 
lead immediately to problems with classical cosmology. If there is a maximal 
distance Ax e beyond which photons emitted on a spherical shell at a time t e h 
can ever reach the center of the sphere, then there is an event horizon. Photons 

2 The presence of a small Cosmological Constant would not change that too much. We know 
that, today, it could account for at most some 80% of the energy density in the universe, and 
as we go back in time this contribution becomes negligible. 




p < 1 



(3.4) 




(3.5) 
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emitted from the outer shells (with radii bigger than Ax e ) at time t e h will 
never reach the center, because the expansion overcomes the propagation of the 
photon. In other words, the photon is rcdshiftcd to zero energy before it reaches 
the center of the sphere. 



In formula (3.3) we make the emission time U = t e h and the observation 
time tt — * oo, and it easy to see that for p < 1 the expression diverges (the 
expansion is not fast enough that it can redshift photons to zero.) For p > 1 
the expansion is fast enough that event horizons are well defined, 

Ax eh (t eh ) = — !— ( , p>l . (3.6) 

In the special case when a = aoe Hot , Hq constant, the event horizon can be easily 
calculated: it is, in comoving coordinates, e Hg ° , and in physical coordinates 
simply H^ 1 . 

• The Structure Formation problem 

If the universe was perfectly homogeneous, no stars or galaxies would have ever 
formed. We clearly need to introduce inhomogeneities that can grow and become 
large-scale structure, but in a purely classical theory these "seeds" for structure 
can only appear as ad hoc initial conditions. What explains the scale-invariant 
pattern of the initial inhomogeneities and their tiny initial amplitude? How did 
they arise? 

• The Flatness problem 

There is a critical energy density p c above which the total mass of the uni- 
verse is big enough to stop the present expansion and turn it into a contraction 
("crunch"). If the density is smaller than this p c , the universe will expand for- 
ever, becoming colder and emptier. If the universe has the density parameter 
f2 = p/p c > 1 it is said to be closed; if CI < 1 it is said to be open; if O equals 
precisely one, the universe is flat and it will continue expanding forever, with 
an expansion rate that vanishes asymptotically. 

The flat case ft = 1 is clearly unstable, just like the Einstein - de Sitter uni- 
verse: any deviation from flatness will increase rapidly with time. In particular, 
the lifetime of a Planck-scale universe with energy density p p i ~ is of the 
order of a Planck time tlj ~ 10~ 44 s. By lifetime here we mean the time a closed 
universe would take to crunch or the time interval that it takes for an open one 
to become effectively empty and void. Only if |fl — 1| ~ a typical planckian 
universe can survive a longer period of time without collapsing or turning into 
vacuum. 

The universe is presently very nearly flat, 0.1 < Oo < 2 , which implies that 
in the past (t <C to) it was much flatter than it is now, |1 — Cl(t)\ <C |1 — f2(to)|. 
In fact, for the density parameter to be so close to 1 today, one Planck time 
after the Big Bang the universe would have to be flat to within one part in 
10 59 , that is, |fJ(£ p i) — 1| < 10 -59 ! The question of why our universe was so 
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astonishingly flat soon after the Big Bang, or cquivalently, why it survived for 
so long, is known as the flatness problem. 
• The Primordial Monopoles problem 

As the temperature dropped in the early universe, symmetries that existed at 
high temperatures are broken. For example, the electroweak phase transition 
occurred at an energy scale around 100 GeV and gave mass to the originally 
massless vector bosons of the Glashow-Salam- Weinberg theory. 

During a symmetry-breaking phase transition, many strange objects such 
as domain walls, cosmic strings and monopoles can be formed. Most of these 
objects are stable, being topological defects, and in the case of monopoles their 
annihilation rate is too small and their creation rate too big to ignore them. 
Sometimes the production of monopoles during a phase transition is so copious 
that they may overpopulate the universe. That is the problem with many grand 
unified theories, which have t'Hooft-Polyakov magnetic monopoles as a byprod- 
uct of symmetry-breaking phase transitions. The monopole creation rates are 
too high, and their mass (10 16 times that of the proton) too big: if they existed 
at all, the universe would have collapsed long ago. The magnetic monopoles 
problem is how to reconcile grand unified theories with the fact that the uni- 
verse lived for so long and is so patently empty of monopoles. The same reason- 
ing can be employed to show that a particle theory with a phase transition that 
breaks a Zi symmetry produces domain walls (regions interpolating |+) and |— ) 
vacua). There is no reason why electroweak domain walls, for example, don't 
exist. However there is no evidence of that, on the contrary, their existence 
would have catastrophic cosmological consequences and is a strong indication 
that there is no discrete symmetry breaking in the Higgs potential. 

3.2 The Inflationary Universe 

It was the last problem, that of topological defects, which precipitated the 
discovery [^oj that a brief period of exponential expansion can explain not only 
the absence of monopoles and domain walls, but also the regularity of the CMB 
temperature map over the whole sky and the reason why the density parameter 
was so close to the critical value 1. 

The rationale is simple: an exponential expansion would reduce any initial 
density of monopoles and domain walls (as it would of any preexisting particles) 
exponentially. This is also known as the "no-hair" theorem of de Sitter space, 
analogous to the one for black holes: in de Sitter space, all particles and inho- 
mogeneities within the event horizon H~ l will have left that sphere (or crossed 
the event horizon) by a time of order if -1 . Events happening inside the horizon 
will not be affected by events outside the horizon. 

After this epoch of inflation, the universe must be again filled with radiation 
and other sorts of matter (but not monopoles or walls), hence the only visible 
effect of the inflationary epoch would be to do away with the monopoles. This 



31 



exit from inflation, called reheating, is crucial to provide a fresh start to a 
hot universe devoid of unwarranted creatures, but with a healthy population of 
friendly animals like photons, baryons, leptons etc. 

This exponential growth of the scale factor will also eliminate the particle 
horizon problem. A period of inflation in the early universe means that between 
a time t\ and a time ti the scale factor grows exponentially, a cx e Hot . After 
this period the universe is reheated, or refilled with radiation, and its evolution 
is once again given by a oc t 1 / 2 . After a bit of simple algebra it is easy to obtain 
from ([O]) that the article horizon gets an exponential contribution. If we set 
ti = ti + NHq 1 we have that, for any time t > t%, the particle horizon is 

Ax ph (t)=2^-j +— . (3.7) 

Clearly the exponential is going to dominate for N large enough, therefore 
this distance can be made much larger than the radius of the present observable 
universe, if inflation lasts for a sufficient number of "e-folds" N. For the particle 
horizon at the time of decoupling to be bigger than the radius of the observed 
universe at the present time, N ~ 50. If N > 50 the horizon problem is 
solved: the CMB is so nearly isotropic because the distant regions of the sky 
we observe now were in fact microscopically close at times t < U in the early 
universe. Their present cosmological distance is due to an inflationary phase 
that took place after U and exponentially stretched all physical distances for a 
time > 50H^\ 

It is clear then that an inflationary phase in the early universe, with a suf- 
ficient number of e-foldings N > 50, is highly desirable. Our task now is to 
construct a realistic model with these features. Below we describe the simplest 
and most successful of those models, the so-called Chaotic Inflation scenario 
H [nj. We do not discuss Old, New, Double or String inflation in this mono- 
graph (for reviews, see j2l], |23|.) 

Consider a scalar field tp with the potential 

y M . (3.8, 
n \ Mpi J n 

where n > 0, < m <C M p i and we have also introduced Planck units, M p i = 1. 
The energy density p — Too of this field is given by (pT5|), 

A*,*) = ^ + ^+n«o , (3.9) 

and its pressure p = —T?/3 is 

A*,*) = £-^-n*o • (3.io) 
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It is not necessary to assume that the scalar field is the only component in 
the very early universe: spin 1/2 and 1 fields could also be present but their 
influence will be vanishingly small, as we show below. 

One Planck time after the Big Bang, the typical energy density of a Planck- 
size region of the universe is, by virtue of the Heinsenberg uncertainty principle, 
of order l[MrJ. Focus on the 3 different components of the energy density and 
pressure, the kinetic terms <p 2 , the gradient terms (Vp) 2 and the potential term 
V(p). We assume that each one of those terms and, in addition, the curvature 
R 2 ~ H 4 should assume values in the vicinity of l[MpJ. These conditions are 
then, 



p 2 ~ 0(1) , (3.11) 
(V^) 2 ~ 0(1) , (3.12) 



—V n ~ 0(1) , (3.13) 



n 



H ~ 0(1) , (3.14) 



The orders of magnitude above are all consistent with the Klein-Gordon equa- 
tions for the scalar field and with the Einstein equations that relate the curvature 
to the energy and pressure density of p. It is easy to see that the value of the 
scalar field p in a Planck-size region H -1 ~ 1 is given most simply by the second 
to last equation, 

P P i ~ m~ 2/ ™ > 1 • (3.15) 
The remaining conditions, ( [3.11 ) and (3.12) impose constraints on the time 



derivatives and gradients of the scalar field. These are actually upper bounds, 
since if any one of the terms is bigger than the Planck scale, it will dominate 
over the others but then the universe does not have a classical description, 
since quantum fluctuations of the metric would destroy the classical notions 
of spacetime. Let us focus on a region of the universe that allows a classical 
description, and in which the scalar field is relatively homogeneous. To be 
precise, consider a region bigger than one Hubble horizon H , in which the 
scalar field have an approximately homogeneous value over a fixed time slice, 
<C |-ff| -2 - We assume, in addition, that the scalar field varies slowly in 

time in that region, ^ -C \H\, although this condition can be relaxed, since 
the friction term 3H p in the Klein-Gordon equation supresses any initial time 
derivatives of the homogeneous field. 

With those assumptions, the energy and pressure of the scalar field are 
approximately given by 



V(p) 
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(3.16) 



P s ~ -V(<p) 



(3.17) 



which, by the continuity equations p + 3H(p + p) — means that /9 s ~ 0. The 
homogeneous scalar field, thus, contribute an approximately constant energy 
density, while all other particles (such as photons, electrons and monopoles) 
contribute an energy density which decays in time like radiation, p r cx a~ 4 (t). 
It is easy to conclude that the scalar field, just like a cosmological constant, 
quickly dominates the evolution of the universe over the other matter fields, 
which are exponentially dissolved or "redshifted" . The scalar in question is not 
necessarily a fundamental scalar, it could be for example a 0-spin condensate 
of a fermionic or bosonic field. That the future evolution of this patch of the 
universe is independent of the non-scalar spectator particles is a consequence of 
the "no-hair" theorem for de Sitter. 

Summarizing, if there is a domain ("bubble") in the very early universe of 
sufficiently large size (bigger than i? _1 ) in which an interacting scalar field as- 
sumes an approximately homogeneous value, this patch will become dominated 
by the scalar field in a time lapse of the order H . Since the main contribu- 
tions of the scalar field to the energy and pressure come from the scalar potential 
V(ip), the universe inflates - the scale factor increases exponentially. We call 
the inflaton the scalar field responsible for cosmological inflation. 

Clearly, regions outside of the inflating bubbles become irrelevant, since the 
inflating domains occupy an exponentially larger share of the global volume of 
the early universe. According to the chaotic scenario, we live in one such bubble 
that inflated, reheated and cooled off to eventually become the dust-dominated 
universe that is visible to us today. As we shall see later, though, not all bubbles 
begin or end the inflationary cycle at the same time. In fact, new inflating 
bubbles are continuously generated in the chaotic scenario, a phenomenon due 
to long wavelength quantum fluctuations of the inflaton field that occur when 
the scalar field is above the so-called self-reproduction scale ip sr — m _2 /(™ +2 ) < 
(p < f pi. A bubble in which the scalar field has fallen to a value smaller than 
ip sr is not affected by macroscopic quantum fluctuations of the scalar field, and 
is said to "evolve classically" (see Section 3.3 for a detailed discussion.) 

Let us consider for now the fate of one such classically evolving bubble in the 
case where the bubble have no boundaries, that is, as if it occupied the whole 
universe (we know this approximation is valid because of the no-hair theorem 
for de Sitter.) For simplicity, assume that at time U there is a bubble of radius 
H^ 1 in which a homogeneous scalar field^j takes the value 1 <§; <fi < <p sr - 

The Klein-Gordon equation dictating the evolution of the scalar field is, 
without approximations and in a FRW homogeneous background, 

3 This assumption is natural since a quasi-homogeneous bubble expands to a radius oc 3H ~ 1 
in a time H~ 1 while during this interval the inhomogeneities grow at a much slower rate. 
Therefore, if we consider as initial time tj + H ~ 1 , we can chose another bubble of radius H^ 1 
inside the original bubble that is homogeneous enough for our purposes. 
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n<f + V'(<p)=<p + 3H<p-\v 2 <p + V'((p) = , (3.18) 

or 

where V = dV(tp)/d(p is the derivative of the potential (|3.S|). Immediately after 
ti this equation can be approximated by 

3H<p + V'c~0 => <P^~ ■ (3.19) 

The scale factor, on the other hand, is determined by one of Eintein's equations 
in curved 3-space sections, 

3H 2 + 3-^=p s , (3.20) 

where k = ±1,0 corresponds to a closed, open or flat universe. The term k/a 2 
should be small at ij, since we assumed that curvature, likewise radiation and 
other matter components, is not predominant at this early post-planckian time. 
As the rapid expansion begins (exponential or power-law with t q and q 3> 1), 
the curvature term becomes negligible as compared to H 2 . In the exponential 
case, H 2 remains constant while k/a 2 oc e~ 2Ht , and in the power-law case H 2 
falls as t~ 2 while the curvature term falls as t~ 2q . 

At this point a major success of inflation is already clear: if the curvature 
term in the Einstein's equation is not big enough to rapidly crunch or empty 
the universe, inflation will make it vanishingly small as compared to the energy 
density of the scalar field. Therefore, open and closed universes that go through 
a period of inflation asymptotically resemble a flat universe. After inflation is 
over and the scale factor evolves as t q with q < 1 the curvature term decays less 
fast than the energy density of matter, and the instability that we discussed 
in the second chapter starts to magnify the curvature again. This scenario 
can then explain why we don't observe a large curvature today: it was made 
exponentially small in the inflationary epoch. 

We neglect the curvature term from now on, since the discussion above shows 
that it is irrelevant during inflation. 

In the simplest but typical case which we consider here, n = 2 in the scalar 
potential, the maximal value for <pi is ip sr — mT x l 2 , and the value above which 
the classical description brakes down is ip p i = mT 1 (see Fig. 3.1.) The scalar 
field "rolls" down the potential after the scalar field drops below ip ar , until it 
reaches (p ~ 1 and the in flation ary e poch ends. 



The solution to Eqs. ( 3.19 ) and ( p.20|) for n — 2 and k = are straightfor- 
ward: 



H -^P ( 3 - 21 ) 



and 
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Figure 3.1: Potential energy V(ip) = m 2 ip 2 /2. Above the energy scale l[Mpj], quan- 
tum gravitational effects are important and the classical interpretation of spacetime 
brakes down. Above the scale ip ~ m -1 ^ 2 , quantum fluctuations of the scalar field 
are the dominant driving force and can force the field to go both up and down the 
potential hill. Below this scale, classical evolution is unaffected by quantum fluctua- 
tions and the scalar field slowly rolls down the potential. After the field drops below 
ip ~ 1[M P ;], inflation ends and the reheating period starts, where oscillations of the 
scalar field lead to particle creation. 



<p(t)-<Pi^-m^-(t-ti) , (3.22) 

where ipi is the value of the scalar field at the time ti . The Hubble parameter 
as a function of time is then 

TY1 2 

H(t)^H i -—(t-t i ) , (3.23) 
with Hi — ^7=<Pi- The scale factor can be calculated from this expression, 

a(t)*a i e H *<- t -V- B £W . (3.24) 

It is sometimes useful to use the homogeneous scalar field as a time parameter, 
and in that case we have the simple expression 

a{t) ~ a t e^-* 2) . (3.25) 

Notice that, since m<l, the time derivative of the Hubble parameter, H = 
— m 2 /3, is much smaller than Hi < m 1 / 2 /\/6, that is, it is valid to approximate 
the Hubble parameter as a constant during inflation. It can also be verified that 
the kinetic term <p 2 ~ 2m 2 /3 is much smaller than the potential term m 2 ip 2 /2, 
so that it can be neglected in the expressions for the energy density and pressure. 
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In conclusion, the solutions found are consistent with the approximations, and 
inflation only makes the solutions more accurate. 

During inflation, quantum fluctuations of the scalar field are coupled to the 
expansion of the universe, which amplifies these fluctuations. After their wave- 
length crosses the horizon size H~ l , the fluctuations freeze (their amplitude 
becomes constant) and quantum corrections cease to be important. These cos- 
mological perturbations from inflation reenter the horizon once inflation is over, 
and they provide the basis for the inflationary model for structure formation. 

We will discuss the generation and evolution of perturbations at greater 
length in the next section, but for now it suffices to say that the scale at which 
the fluctuations are generated is approximately constant during inflation. Fluc- 
tuations of the size of the Hubble radius have approximately the same amplitude 
during inflation. After a perturbation becomes larger than the horizon, its am- 
plitude freezes, although its wavelength continues to be redshifted as X p h oc a(t). 

When inflation is over, the Hubble radius increases much faster than a(t), 
and scales outside the horizon start to cross back into the horizon. Provided 
the reheating period (see below) is much smaller than the inflationary epoch, 
the perturbations generated during inflation with same amplitude reenter the 
Hubble radius H^ 1 after reheating. Since their amplitude has been frozen from 
the moment they crossed out of the horizon, when they cross back into the 
horizon their amplitudes are obviously approximately equal. This scale-invariant 
spectrum of perturbations obtained after the end of inflation is verified by the 
analysis of the CMB, and constitutes another major success of the inflationary 
models (but not one exclusive of these models.) 



When do inflation end? From Eqs. (3.9) and (3.1C) we see that the kinetic 



term ip /2 ~ 2m /3 becomes comparable to m <p 2 /2 when p — p re h — 1. At 



this point the second derivative in the Klein-Gordon equation ( 3.18| ), ip, cannot 
be neglected anymore. In fact, when the scalar field reaches p re h it accelerates 
down the potential, since the friction term 3H<p has become much smaller. 
Another way of seeing this is noting that the natural time scale of the inflaton, 
m~ , becomes equal to the cosmological time scale H~ l when p ~ 1, and after 
the field falls even further, the friction term becomes much smaller, ending the 
"slow-roll" (see Fig. 3.1.) 

In the new regime the scalar field oscillates fastly between two maxima 
of the potential with very little friction. The Klein-Gordon equation can be 
approximated by 

ip + m 2 <p~0 , (3.26) 
whose solution is straightforward, 

p(t) ~ p., sin (mt + 9) . (3.27) 

Notice that energy density and pressure oscillate together with the scalar field, 
but if we average over a cycle to -1 the result is 
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(P)m-i = ^ , (p) TO -i=0 . (3.28) 

This is the equation of state of pressureless dust, p — 0, and we know that 
in that case the continuity equations p + ZH(p + p) = imply that the energy 
density falls like a~ 3 . This time dependence is hidden in ip„ , which in fact decays 
slowly, while the scale factor falls as a power-law so that the approximation is 
consistent. 

This period of fast oscillations (with period m _1 3> H~ r ) marks the end of 
cosmological inflation, since the scale factor now grows as a power-law, a oc i 2 / 3 . 
At this point the couplings of the inflaton field to other matter fields becomes 
also important, and particle production occurs through the mechanism of para- 
metric resonance . This period is known as "reheating", and is the process 
by which the empty post-inflationary universe is replenished of the hot particles 
that we know were present at very early times. Of course, the temperature of 
the plasma during reheating should not be bigger than the GUT scale, otherwise 
the problem of monopoles would strike again. 

The inflaton field eventually decouples from the other matter sectors, and 
usually it is assumed that it settles at the minimum of its potential. At the 
end of reheating virtually all the energy contained in the scalar field at the 
end of inflation has been converted into ultra-relativistic particles (radiation). 
Sometimes the inflaton field leaves remnants, and it has been suggested that it 
could even provide some of the dark or "missing" matter predicted by primordial 
nucleosynthesis. 



3.3 Cosmological perturbations from inflation 

One of the main applications of inflationary models is the origin of large-scale 
structure. Inflation provides a causal mechanism whereby microscopic quantum 
fluctuations of the scalar field evolve into the classical cosmological perturba- 
tions that originated the structure of the universe. 

During inflation, the analysis at the end of Chapter 2 shows that the metric 
perturbations have a time dependence given by Eqs. ( |2~79| ) and ( pc|) , 



A +(HF) +ik f dt/a a-(HF) -ik f dt/a 
k e J "T ^i^. c J 



when the perturbation is smaller than the horizon, and 



(3.29) 
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(3.30) 



when it is bigger than the horizon. 

The factors are as yet undetermined, and the classical theory of pertur- 
bations does not provide any clue to what their value is. Of course, they are 
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fixed by the initial conditions. When there is no theory available to provide 
these initial conditions, we can only determine the Ak's through the observed 
inhomogencitics in the present-day universe, and use the theory of cosmological 
perturbations to deduce what the "seeds" might have looked like at very early 
times. Modern attempts at explaining large-scale structure such as inflation 
and topological defects, try to provide physical mechanisms that could generate 
a spectrum of A k that is consistent with the observed map of the CMBR from 
a minimal set of parameters and initial conditions. Chaotic Inflation is partic- 
ularly minimalistic, and necessitates only the self-couplings of the inflaton and 
its initial value on a homogeneous time slice. 

Clearly, the theory about the origins of the seeds are outside the scope of the 
classical theory of cosmological perturbations. In the inflationary scenario, the 
mechanism stems from the inherently quantum fluctuations of a scalar field in 
flat spacetime. In the case of topological defects, the origin of perturbations is a 
"classical" effect in the sense that the perturbations are created by the motions 
of macroscopic solitonic objects. 

On small wavelengths, fluctuations of the scalar field around a homogeneous 
background ipa can be described by the usual canonical quantum operators 
acting on a Fock space. When their wavelength crosses the Hubble radius, the 
fluctuations become squeezed in states with a very high occupancy number, and 
are thus best described by functions with a statistical distribution. Fluctuations 
bigger than the horizon during inflation are said to be classical perturbations, 
in the sense that they follow the classical equations of motion, 5G^ U = ST^ U , 
without quantum corrections. 

Fluctuations of the homogeneous scalar field inside the horizon can be treated 
by quantum field theory. Outside the horizon the expectation value of the 
field excitations is very high, and can be replaced by a sum over momentum 
modes of classical fields with amplitudes obeying a Gaussian statistical distri- 
bution. The spatial average (8f) of linear perturbations yield zero identically, 
just as happens with the expectation value of a linear quantum fluctuation 
((0\5f + a + + 5f~a~\0) = at all times). The notion of spatial average here is 
defined on the biggest possible scale, that is, on the scale of the physical radius 
of the entire inflating bubble, H~ x (ij)a(t) / a(ti) ^> H~ x (£), where U is the time 
when inflation started in that bubble. Of course, fluctuations around the homo- 
geneous value are by definition zero on this largest scale, although they might 
be nonzero on smaller domains inside the initial bubble. 

The two-point functions of fluctuations, as opposed to linear functions of 
the perturbations, do not vanish in general, (0\SfkSfk'\0) oc Sk'k- Knowledge of 
the correlation functions of field operators at the horizon scale (|fc| ~ H) gives 
the amplitudes of modes |<5//t| 2 with a classical interpretation and Gaussian 
statistics. 

The reasoning used below is as follows: we first determine the amplitude of 
the modes of the perturbations when they cross outside the horizon. That fixes 
the amplitudes of all modes created during inflation that are of cosmological 
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interest. We then just follow the classical evolution of the modes to obtain the 
spectrum of the perturbations, both during inflation when their wavelength is 
exponentially bigger than the horizon H , and after inflation ends when they 
start to reenter the horizon. 

We start by presenting a non-rigorous derivation ]5| of the "spectrum" 4>k, 
based on the Hawking temperature associated with a de Sitter spacetime |p5| . 
Quantum fluctuations of the scalar field on scales of order the Hubble radius 
H^ 1 should have an amplitude proportional to the Hawking temperature of the 
thermal fluctuations of de Sitter. Their root mean square value on a sphere 
with a radius of the size of the Hubble horizon is then 

\6<p H \~T H = ?L . (3.31) 

Since <p(t) ^> 1[M P ;] and H ~ mtpo{t) during inflation, these perturbations are 
small, as expected: 5(p/(po <C 1. The energy density of the universe, during 
inflation, is dominated by the potential energy V (<£>), and thus fluctuations with 
scales close to the Hubble horizon should also be proportional to the fluctuations 
of the scalar field, Spn/p ~ SV/V. Using the model with V = m 2 <p 2 /2 and Eq. 

( p. 31 ), as well as the identity[[t3| — = —2<p\ k ^ valid when the perturbation 

9 k(t) 

crosses the horizon, k{t) = H, we get that, during inflation, the amplitude of 
the modes for the scalar perturbation <f>jj is (up to some numerical coefficients), 



~m . (3.32) 

This amplitude corresponds to a perturbation on the scale of the Hubble radius, 
and real perturbations are expected to be positive or negative, that is, the 
fluctuations either raise or lower the energy density inside the horizon. 

We are primarily interested in the spectrum of the momentum modes, so we 
need to perform a Fourier transformation starting from the fluctuations 4>h(t). 
This quantity reflects net perturbations inside a sphere of radius r, and its value 
can assume both negative and positive values. The best we can do then is to 
evaluate the square of this quantity on spheres of radius r. The integration over 
volume inside the sphere introduces a factor of r 3 ~ fc~ 3 (r), so we find the result 

$(r) ^ l&Hr - |^ W l-fe- 3/2 WI^H| r • (3.33) 

l^lfc(r) ^ S usually referred to as the power spectrum of the field tf> at the scale 
k(r). We finally find that, for quantum fluctuations with size comparable to the 
Hubble radius, k ~ H, 

|0| fe ~fc- 3 / 2 m . (3.34) 

Due to the special time-translation invariance properties of de Sitter space- 
time, we can generalize this result to other scales. As soon as the physical 
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wavelength of a fluctuation becomes bigger than the horizon, the time depen- 
dence of its amplitude <j>k is given by the classical equations of motion and, as 
a consequence, its amplitude freezes [actually it evolves very slowly with time, 
0fc/0fc <C H, see Eq. ( 2.80| )]. What that means is that we can assume that 



the amplitudes (3.32) carries on for a long time after the perturbation leaves 
the horizon, and, as long as inflation goes on, this amplitude is approximately 
constant. Therefore, during inflation and for modes with wavelengths bigger 



than the horizon, the spectrum is given by (3.32), where the fc's correspond to 
scales bigger than the horizon, k > H. 

The argument above relied on a separation between the classical and the 
quantum regime that was criticized for being artificial. The full quantum treat- 
ment is slightly more complicated, but the end result, Eq. ( 3.34| ) during infla- 
tion, is essentially the same. 

We now proceed to the full quantum treatment of the creation of perturbations 
Our aim is to quantize, using the canonical scheme, the coupled scalar and met- 
ric perturbations, and calculate their two-point correlation functions (Sf 2 ). In 
order to use the variational principles (find the canonically conjugated momenta 
and the Hamiltonian) , the action must be determined to second order in the 
perturbations. We start from the fundamental action 

S = J ^^xf-^R + cA , (3.35) 

where R is the Ricci scalar curvature and C m is the Lagrange density function 
for the matter sector. We are interested in a scalar field, for which 



C ip = -D»<pD li <p-V(<p) . (3.36) 



1 

2 

We should now expand the action ( 3.35| ) to second order in the scalar per- 



turbations <p and Sip, around the flat FRW background with a(t) and <po(t). The 
O t/l -order term gives the equations for the backgrounds, while the l s *-order term 
is identically zero (its variation SS is proportional to the background equations 
of motion) . 



It turns out that, due to the constraint (2.67), there is only one dynamical 
variable expressing scalar perturbations, the gauge-invariant potential We 
shall find explicit solutions for the modes $k, then relate those to the modes 
Sifk and (jy^. Finally, we give explicit expressions for the spectra of the scalar 
perturbations. 

The action in terms of $ is most easily expressed in conformal time, and we 
simply state the result here (for a detailed discussion, see Ref. p3|): 

6 2 S=^ fd^x Id' 2 - (W) 2 + y 2 } , (3-37) 
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where we have discarded several total derivatives that do not affect the quanti- 
zation procedure or the classical evolution. The potential $ has dimensions of 
mass, and is defined as 



•d(r],x) = a5(p(r/, x) + z<P(t]x) , (3.38) 



where H = a' /a. The action ( [3.37 ) describes a scalar field $ with a time 



dependent mass M 2 = —z"/z, in analogy with the equation of motion (2.72) 
for the perturbation u defined in Section 2.6 . This mass is, as before, a function 
of the background quantities ifioiv) an d T~t- When the scalar field is not rolling 
down the potential, </V = = z, the perturbations are not amplified and as 
a consequence there is no creation of particles. The quantization procedure 
used here is also analogous to the quantization of scalar fields in Minkowski 
space-time in the presence of external fields. 

The second step in canonical quantization is to determine the canonical 
momenta associated with and impose the commutation relations between 
the field operators and the canonically conjugated momenta. The momenta are 



defined with the help of (3.37) as 



*•(»/> *) = -00T=0' • (3-39) 

In the quantum theory, fields and momenta $ and it are raised to the cate- 
gory of Heisenberg operators $ and w, and we impose the canonical same-time 
commutation relations 

[S(r),£)J(r),3?)] = [n(r],£),7t(v,^)}=0 (3.40) 

and 

[&(ri,x),n(ri,tf)]=i6(£-&) . (3.41) 

The dynamics of the field operators is given by the hamiltonian equations 
of motion 

id' = [3, H] , in' = [tt, H] , (3.42) 

where the Hamiltonian can be easily reconstructed from the second-order action 
5 2 S, 

H = Jd 3 x(tf'Tr-£) = ^jd 3 x^Tr 2 + (W) 2 -^-d 2 ^ . (3.43) 
The equation of motion for i? can be obtained from the hamiltonian system 



(3.42), or equivalently by varying the action 8 S with respect to the field 
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0" - V 2 d - ^-S = . (3.44) 

The best way to solve this equation is, clearly, to expand the field in plane 
waves (generalized spherical harmonics in the case of open and closed spaces): 

Hrj, x) = ^J j-^ [r k e^a- + 4 k e-**4\ , (3.45) 

where the operators a will be interpreted as raising and lowering (creation and 
annihilation) operators acting on the vacuum |0) of the Fock space representa- 
tion (we use the Hcisenberg representation). All time dependence is contained 
in the functions #fc(?7), which now satisfy the equations 

tffc" + (k 2 - z"/z)ti k = . (3.46) 

In terms of the creation and annihilation operators, the commutation rela- 
tions are the ones for a simple harmonic oscillator, 

[a k ,a k ,} = [a+,a+}=0 , [a k ,a+]=S(k-k') . (3.47) 

The mode functions "&k(il) must obey the following normalization conditions so 
that the last commutation relation is equivalent to {&(x),tt(x')] = iS(x — x')\ 

W k -r k 'd k = 2i . (3.48) 

Usually, modes associated with the annihilation operator a k have a positive 
frequency w 2 (k) = k 2 + M 2 , while the modes corresponding to a) have neg- 
ative frequency. In our case the mass term M 2 = —z"/z is time-dependent, 
leading to a mixing between the positive- and negative-frequency modes asso- 
ciated with a k and ojj". Hence, even if we define a vacuum state for the Fock 
representation at some initial conformal time |0, 771) , this vacuum will appear 
to have a non-vanishing particle content at a later time, a^(^2 > f?i)|0, rji) 7^ 0. 
Mathematically, it is possible to define eigenfunctions ipkivi) °f the creation 
and annihilation operators at time rji, but these cannot be expressed in terms 
of eigenfunctions at a later time 772, V'fc^)- The number operator at time 
rji, Nj. — a t ( 7 li) a k (Vi) ^ vanishes for the vacuum |0, 771) , but is not zero if the 
operator is evaluated at a later time, 



(N 2 ) m =(0, m \N k ( m )\0, m )^0 . (3.49) 

Conversely, it is equivalent to say that the creation and annihilation opera- 
tors at different times mix with each other. With that perspective, it makes no 
sense to define a unique set of eigenfunctions that arc annihilated by all oper- 
ators a k {t). The linear transformations that relates creation and annihilation 
operators at different times are called Bogoliubov transformations: 
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( a k {m) \ = ( a k{rinm) Ptivum) \ ( a k {m) \ (3 50) 

where unitarity implies that |o:/c| 2 — |/3fe| 2 = 1. 

If wc define the number operator at the time 772 in terms of the vacuum at 
772, it is clear that, at an earlier time 771, the occupation number is given by 



(Nl) m = (Q im \4(m)^(m)%m) (3.51) 



(0,?7i|a fe (rJi)/? fc (??i,772)/9fe(r7i, ry 2 )a+ (77i)|0,77i) = \(3 k {vi, ^2) | 2 



and conversely, if we define the vacuum at 772, the number operator defined in 
terms of the vacuum at rji has the expectation value 

<iV, 1 ) J , 2 = <0,7 7 2|a+(77 1 )a^(7 ?1 )|0,7 7 2) = |/3 fc (7 7 2 , 7 7l )| 2 . (3.52) 

The "forward" (771 — > 772) and "back ward " (771 *— 7/2) Bogoliubov coefficients can 
be related by inverting the matrix ( |3.50|) and taking into account the unitarity 
constraint, 

akim^m) = a *{m,m) . Pk(m,m) = -Pijii,m) > (3-53) 

therefore it is a consequence of unitarity that (Nl) V2 = (N%} m . 

To summarize, an observer at 772 will measure a nonvanishing number of 
particles N%(r)i) in the "vacuum" state |0, 771) as long as the Bogoliubov trans- 
formations are not trivial, i.e., as long as (3 k 7^ 0. This "particle production" 
from an initial vacuum state is a consequence of the time-dependent background 
external fields, and is the mechanism responsible for the origin of cosmological 
perturbations in inflationary models. 

The ambiguity in chosing a vacuum is related to the freedom in choosing 
the precise initial conditions for the mode functions $ k and i5l at some initial 
time. Since the equations of motion that determine the °d k s is second-order in 
time, there are two integration constants that must be fixed. The normalization 



constraints (3.48) reduce the number to one free parameter, which is essentially 
the choice of an initial time 770 such that the vacuum |0, 770) is empty at 770. 

During the inflating period, high frequency modes are redshifted by the 
expansion of the universe, and eventually become low frequency. We will define 
the vacuum states in the limit k — > 00, when the perturbations are well inside 
the horizon and we can consider the background to be approximated by flat 
Minkowski, therefore the ambiguity in the definition of a particle that occurs 
for long w avelen gth modes in curved spacetimes is abse nt ||13||. The equations 



of motion ( 3.46 ) and the normalization conditions ( 3.48 ) imply that the mode 
functions are 
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# k (Vi)-M(kr h )k-V 2 e +ikr » , ti' k ( Vi ) ~ iN(k m )k 1 / 2 e+^ (fcrjj » 1) , 

(3-54) 

where |M(fcry,)| — ► 1 and |iV(fc?7j)| — » 1 when fcjft 3> 1 and the normalization 



condition is 



MTV* + M*/V = 2 . (3.55) 



The time rji can be chosen to be the time when the inflationary epoch begins, 
so that the vacuum is defined with respect to this initial moment. If there are 
particles or inhomogeneities present at that moment such that vacuum would 
not be the appropriate definition, this "initial time" can be chosen a few e-folds 
later, when the expansion of the universe will have diluted the initial deformities. 

The choice of initial time is akin to the choice of initial conditions on a big 
enough patch of the universe. Since the observed universe originated inside the 
Hubble radius at TV ~ 50 e-folds before the end of inflation, we could chose this 
initial time as being any moment from the beginning of inflation up to the time 
?76o when there were 60 e-folds left to the end of inflation. Although the choice 
of initial vacuum in this time interval does not alter the results of calculations of 
the CMBR, it will play a significant role when we discuss the global features of 
the chaotic scenario, that is, features of the spacetime on scales of the particle 
horizon H~ 1 (ti)a(to)/a(ti) and larger (to is the time now and U, as before, is 
the time when inflation started.) Notice that if inflation occurred for more than 
60 e-folds, the particle horizon could be many orders of magnitude bigger than 
the presently observable universe. 

We now wish to go back to the scalar perturbations <j> and 6<p. First, re- 



member that the two are related by the constraint (2.67) 



H(j>=^ 8ip , (3.56) 



so that all we have to do is relate •& to cj>. From the definition of $, ( |3.38| ), and 



Eq. ( [2.68D , it is straightforward to obtain, after some juggling with algebra, 
that 

- -If (*)' (3.57, 

where, as before, z = atp' /H. 

The reason why we used the potential $ instead of the variable <f> in the 
quantization procedure is that the action in terms of tf> is much more compli- 
cated, and thus the quantization much more involved when written in terms of 
that variable. 



The equation obeyed by 4> k have been derived before, see Eq. (2.72), 
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fc u k 



(VfT 

l/z 



Uk =0 



(3.58) 



where the variable Uk = afik/p'o, introduced in Eq. (2.73), is obviously related 
to by 



l_z /0 fc 

fc 2 2 I z 



u k = -To ~ — 



(3.59) 



We will use now explicit solutions to Uk in the short and long wavelength 
limits, and subject them to the constraints (3.54) and (3.55), found through the 
quantization of the $-modes. First, in the short wavelength limit of Eqs. (3.58), 
fc 2 3> (l/z)" /(l/z), there are clearly oscillating solutions with frequency k. 
We impose on these solutions the constraints found upon quantization, (3.54), 
which, using the relations above, are 



Uk(m 



1 N(k Vi ) 



1 

fc 5 / 2 



z(rji) fc 5 / 2 " ^ ^ 



(3.60) 



M(k m ) 



z(rji) V fc3/ 



z(rji) fc 5 / 2 



M(krn 



(3.61) 

The solutions in the high frequency, small wavelength limit are then written as 



Uk(v) = Uk(Vi) cos [k(r] - 77,)] 



u' k (rn) 



sin [k(r] - rji)] 



(3.62) 



The low frequency, long wavelength solutions [when fc 2 <C (l/z)"/(l/z) ] are 

1 



Uk 



A k - d-qz 



(3.63) 



but we can use the background equations inside the integral, integrate by parts 
and the final result is 



u k (rj) = A k \ ( - f d V a 2 ) . (3.64) 

So far the constants Ak (which, as we discussed before, determine the spec- 
trum of the long wavelength perturbations) are undetermined. But the spec- 
trum Uk(rji) for small wavelengths has been fixed by quantization. Therefore, 
what we have to do to determine the AkS is follow a short wavelength mode 
with physical wavelength = 2ixa/k until it reaches the limiting wavelength 
27rafc~ 1 ~ yj (l/z)/(l/z)", and then use the junction conditions to fix the am- 
plitude of that mode for all later times. 
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A rigorous treatment would forcibly involve the approximate solutions when 
k 2 <~ (l/z)"/(l/z), which are very difficult to obtain. There is a much simpler 
way, though, which involves taking into account the relation between the and 
■Life given in Eq. ( |3.59| ). We know that for the modes the scale above which 
the fluctuations are high frequency is — Z " I z. For the uu modes, though, 
that scale is k 2 = (l/z)"/(l/z). Therefore, if these scales overlap over a wide 



enough b and, both the oscillatory solutions ( 3.62 ) and the large- wavelength 



solutions (|3.64| ) must be good approximations to the exact solution. During an 



inflationary period, this interval is indeed large, in fact 



1 \ " z 'l m 2 

-J -mVfo) « -^H 2 ( V )^—vl(ri)a 2 (ri) , (3.65) 

since <po 1 during inflation. Therefore, for scales such that ma(r]) < k < H(rj), 
both solutions are approximately valid. We can then express the approximate 
solutions in this interval as 



u k {v) 



UkiVi) cos (kiji) - [u' k {rii)/k] sin (krji) 1 / 1 



{(l/^ )[(l/a)Jd W 2 ]% n(k) <pf \a 



dr]a' 



(3.66) 



where the junction time rjn{k) represents the time when the scale k — T-C(jfu(k)) 
crosses the Hubble radius, that is, when the physical wavelength of the mode k, 
X p h = 2ira(r])/k, become s big ger th an th e horizon H^ 1 = aH—1. 

Therefore, formulas ( |3.66 ) and ( 3.62|) are, respectively, valid for long wave- 
length and short wavelength perturbations in an inflationary universe^. We will 
use these formulas to calculate the amplitude in each mode of the field <j>k, for 
fc's representing both perturbations bigger and smaller than the horizon. As 
noted after Eq. ( 3.5S| ), the modes 4>k can be expressed in terms of the modes 
u k by 



<t>k(v) 



ro 



u k (ri) 



(3.67) 



For the simple case when the inflaton potential is V — m 2 ip 2 /2, the expres- 
sions for long and short waves simplify substantially, and after we take into 
accou nt the approximate solutions for the scale factor in ( 3.25 ), approximation 
( [j.2l| ) and the useful indentity TL ~ — 2(p' Q (po the result is 



|^|( t ) = fc -3/2 



m 



1 



2\76tt 



x 



1 



log 



{.H(t)a{ 



(3.68) 



fl(t)-fl{t T ) \H(t)a(t) i 

4 In fact, the formula for the long wavelength perturbations is valid for perturbations 
outside the horizon even after inflation has ended (see |13| for a detailed analysis). 
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with k > H(t)a(t) in the former case and Hia{U) < k < H(t)a(t) in the 
latter. In this expression, t, is the time when inflation started, and t r is the 
time when it ends. To the reader concerned with units, we note that had we 
retained units in the expression above, we would get m/M p i (or \/Gm) instead 
of m. We also notice that we have retrieved the result of Eq. fl3.34| ), with a log- 
arithmic correction that comes from the fact that the Hubble parameter is only 
approximately constant during inflation, such that the Hawking temperature at 
the beginning of inflation is bigger than that at the end. 

We can also calculate the amplitude of perturbations after inflation is over. 
Assuming that the universe behaves essentially as if it was dominated by radi- 
ation, the scale factor has the time dependence a oc t 1 ^ 2 (this is a good approx- 
imation because during the brief period of reheating, a oc t 2 ^). The result is 
very simple, 



4>k(t > U) 



fc- 3 / 2 mlog(^) 



(3.69) 



where X p h — 2ira(t)/k is the physical wavelength of the perturbation with co- 
moving wavenumber k, and A 7 is the characteristic wavelength of the cosmic 
background radiation. 

In observational cosmology the object of interest is the power spectrum, 
which, as mentioned at the beginning of this section, is defined as the adimcn- 
sional measure of a classical fluctuation moded out by a volume factor, e.g., 



fc 3/2 \A 



(3.70) 



and 



= k 3 / 2 



(3.71) 



From Eqs. ( 3.68 ) and ( 3.69 ) we see that the power spectrum of the scalar 
perturbations created during inflation is nearly scale invariant, with only log- 
arithmic corrections that are not very important on scales that are relevant 
cosmologically. The power spectrum is also proportional to the square root of 



the self-coupling of the scalar field (vm 2 in our case of a massive inflaton). 
The fluctuations in the temperature seen in the CMBR are roughly propor- 
tional to the spectrum calculated above, and the amplitude measured is about 
ST/T ~ 5 x 10~ 6 . This implies that the mass of the inflaton has to be very 
small, m ~ 10~ 6 [M p ;], so that the perturbations created by inflation have the 
correct amplitude. That the mass of the inflaton happens to be so small is 
referred to as the "fine-tuning problem" . 

Summarizing, we saw that inflation provides a causal mechanism that gen- 
erates perturbations with near identical amplitudes on cosmological scales. The 
power spectrum of perturbations is nearly scale-invariant, and is proportional 
to the mass of the inflaton. These perturbations are generated continuously 
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during inflation, inside the causal Hubble horizon. They leave during inflation 
(their physical wavelengths become bigger than the horizon) and reenter later, 
after inflation ends. 



3.4 Global structure of Chaotic Inflation 

We have mentioned in Section 3.2 that, above a certain scale, large quantum 
fluctuations of the scalar field might give rise to a process that generates new 
inflating bubbles 0. We will see later that the back reaction of scalar pertur- 
bations created during inflation is important on scales where the process of 
self-reproduction is active. In this section we explain how the chaotic inflation- 
ary model can regenerate itself, and investigate some features of the universe 
on very large scales g. 

ft was shown in the last section that small wavelength quantum fluctuations 
originated inside the Hubble horizon have their amplitude magnified, and their 
wavelength redshifted, as inflation proceeds. If perturbations have a small am- 
plitude, they don't have a noticeable effect on the evolution of bubbles. On 
the other hand, if a perturbation in the energy density inside an initial bubble 
is so large that after it leaves the horizon the background energy density in 
"new" bubbles is affected, then the perturbations have to be accounted for in 
the dynamics of these bubbles p6|. 

We will show below that above a certain energy scale (the so-called "self- 
reproducing scale"), quantum fluctuations of the scalar field are so large that 
they are dominant in the dynamics of the bubbles, and actually drive the scalar 
field up its potential hill in most regions of the universe p7|. Since the expansion 
rate is proportional to the potential energy of the scalar field, regions in which 
the scalar field is higher up its potential expand faster, and as a result these 
bubbles dominate the ultra-large scale structure of the universe. A bubble in 
which the scalar field happened to fall below the self-reproducing scale (called 
also a "mini- universe" ) evolves classically, in the sense that the classical equation 



of motion for the scalar field (2.15) is obeyed, without quantum corrections. The 
expansion rates of these classical mini-universes are much slower than elsewhere, 
therefore they occupy an ever decreasing volume fraction in the universe. 

In bubbles where the energy density is above the self-reproducing scale, 
quantum fluctuations drive the scalar field both up and down the potential. 
As a result, after a few e-folds an initially homogeneous bubble would turn 
into a rather inhomogeneous collection of bubbles, some with higher, some with 
lower energy density (see Fig. 3.2). From a homogeneous bubble, after a few 
e-folds we would get many different bubbles with different energy densities, 
expansion rates etc. In particular, the process could drive the energy density in 

5 As we have indicated before, a bubble is a horizon-sized spherical domain in the expo- 
nentially expanding universe. After one e-fold, there are e 3 "new" bubbles inside the initial 
bubble. 
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Figure 3.2: A region of constant comoving volume with an average value of the scalar 
field if> > ip sr ("initially homogeneous bubble"). Dark bubbles correspond to domains 
where the scalar field fluctuated to a value bigger than (p, and in the light bubbles it 
fluctuated down to ip < ip sr . The dark bubbles occupy an exponentially high comoving 
volume of the initial bubble, since their Hubble parameter is bigger than the Hubble 
parameter of the initial bubble. 

some regions to such high values that the symmetries of the fundamental Grand 
Unifyed Theory would be restored. If a domain happens to cool down and come 
out of this symmetry-restored region to become a FRW universe, the physical 
properties of this "baby universe" could be radically different from ours, since 
its GUT vacuum is not necessarily the same as ours. Hence the names "self- 
reproduction" and "mini-universes": according to the chaotic scenario, there is 
a continuous creation of inflating bubbles, each of which might have completely 
different physics. 

In all fairness, the global structure of the chaotic scenario bears no immediate 
concerns for astronomers preoccupied with the observable universe, since all of 
it would have come from an entirely classical region some 60 e-folds before 
reheating. However, it is crucial when we consider, for example, the nature of 
the Big Bang singularity, the initial conditions for the universe and its topology, 
and as a means to justify the anthropic principle in inflationary cosmology. 

A rigorous analysis of the global structure involves the diffusion equation for 
the probability distribution of the value of the scalar field, P((p,t). The final 
result, though, is the same as the one obtained by a much simpler argument 
explained below ||. 

A classical description of spacetime is only possible when the energy density 
of the universe has fallen below p ~ V(<p) ~ 1[M^]. The value of the scalar 
field, in the massive case, is by assumption smaller than <p < ip p i — m , with 
the mass constrained by the observed CMB anisotropics to be m ^< I0~ 6 . A 
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microscopic domain of the size of a Hubble radius H pl 1 ~ l p i = 10 44 cm would 
have inflated to a radius today of about 

R(t ) ~ l pl e~^ 2 ^-^ ~ e 10l °cm , (3.72) 

quite a few orders of magnitude bigger than the observable universe today. 
Notice that the estimate above is actually a lower bound, since we know that 
quantum fluctuations should make most regions inside the initial planck-sizc 
bubble inflate for much longer than that. 

The generation of long wavelength perturbations is completely analogous to 
the mechanism described in the previous chapter: quantum perturbations of 
the scalar field arc rcdshiftcd, while their amplitude is approximately frozen 
once they become larger than the horizon. The difference is the amplitude of 
the quantum fluctuations in a region of the universe in which the scalar field is 
above the self-reproducing scale. As we have noted before, quantum fluctuations 
during inflation have a power spectrum given by 

|A^|~^ , (3.73) 

which can be interpreted as meaning that the scalar field takes a step of size 
H/2ir every time lapse At = H~ l . 

At the same time, the approximate equation of motion during inflation, Eq. 



(3. IE), implies that after the same time lapse H 1 the change in the scalar field 



from the force V v will be 



A cl <p~¥*-H- 1 2i- ■ (3.74) 
V <p 

By comparing the two values we see that if we require that quantum fluctuations 
be unimportant, then 

ip <c ifsr = Tn~^ , (3.75) 

and the converse if the classical motion of the scalar field is negligible when 
compared to the quantum fluctuations. Notice that even though the quantum 
fluctuations are important in this limit, we still have |<5y;/y;| <C 1 so that the 
linear perturbative approach is still valid. 

The behavior of a domain with ip < ip sr is very simple, and is given by the 
familiar picture of a scalar field rolling down the hill of the potential. Regions 
in which ip > ip sr are much more interesting, since they can show a multitude 
of distinct behaviors, including mini-universes with radically different physical 
properties. The continuous creation of inflating bubbles from other inflating 
bubbles means that spacetime has a fractal structure, maybe even a self-similar 
one. Given the virtually infinite variety of mini-universes that can be created, 
the chaotic scenario implies that some universes must be capable of supporting 
life. 
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The main result from this section is Eq. (3.75): above the self-reproducing 
scale, quantum fluctuations dominate the evolution of the scalar field. In a 
mini-universe like ours, the value of the scalar field at the beginning of inflation 
was ip sr — m -1 / 2 , and it rolled down from there. The total size of our mini- 
universe is then roughly Lq ~ 10 10 cm, still a number for which units are quite 
meaningless. 

The main point of this monograph is to show that the influence of the quan- 
tum creation of perturbations is not restricted to the effect to linear order caused 
by fluctuations in the self-reproducing epoch. In the next chapter we discuss 
how cosmological perturbations for which the linear effect is negligible can still 
have a decisive effect on the dynamics of the background. 
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Chapter 4 



The Effective 

Energy-momentum Tensor 



Gravitational waves curve the space in which they propagate. But how can the 
gravitational field be at the same time the cause and effect of the gravitational 
field? What is the difference between the gravitons that make up a gravity 
wave and the gravitons that constitute a background metric in which the gravity 
waves propagate? How do they interact? 

Due to the nonlinearities of Einstein's equations, the notions of gravitational 
fields and sources are not as markedly distinct as in other field theories like 
electromagnetism. Indeed, if gravity can be described by anything that looks 
like a quantum field theory, then in some classical limit of this theory there 
is a spin-2 massless particle, the graviton. The propagation of gravitons in 
flat spacetime can be described in terms of transversal plane wave solutions 
to the classical (usually Einstein's) theory. These waves, parametrized by a 
wavenumber k, clearly carry some energy and momentum proportional to k, 
and as such possess a "gravitating mass" , in analogy with non-abelian Gauge 
Theories where the field strength can carry color charge. 

The situation is more complicated when the background in which the gravi- 
tons are propagating is a curved one. Nevertheless, in the so-called Geometrical 
Optics limit, when the wavelength of the graviton is much smaller than the cur- 
vature radius, the former result is still valid: gravitational waves have energy 
and momentum and, therefore, are themselves capable of curving spacetime. 
The back reaction from gravitons in the high frequency limit has been explored 



in many applications, most notably with respect to nucleosynthesis |28| . There, 



an excess of gravity waves can speed up the cooling rate of the universe and 
increase the fraction of helium, leading to constraints on the energy density of 
gravity waves at the time of nucleosynthesis from the observation of the abun- 
dances of primordial elements. 
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These ideas are old, and have been pushed forward since the late 50 's and 
60's, by the work of Wheeler ||, Brill and Hartk@, Brill[§l|, and culmi- 
nating with an important result by Isaacson Jl]], who showed that the energy- 
momentum tensor of small wavelength and high frequency (HF) gravitational 
waves is "gauge invariant" , i.e., has the same value in any reference frame. This 
result was crucial in that it put to rest any fears that the energy density and 
momentum of gravitational perturbations are a gauge effect. 

Some questions, though, remain open, namely, what happens outside the 
limit of HF perturbations? How to apply the results to cosmology and, in 
particular, to inflationary cosmology? 

4.1 Historical remarks 

The first person to think of using gravitational waves as a means to curve a 
homogeneous spacetime was Wheeler ^| . Consider geometries in which a back- 
ground changes smoothly, due to the presence of perturbations with high fre- 
quency but small amplitude. Wheeler showed that, as long as the perturbations 
had small amplitudes, the metric could be regarded as a smooth background 
{luv) with small ripples (h^), and that the Eintein's equations made sense 
perturbatively, even when the perturbations were HF. 

In the absence of matter, the Einstein's equations for a background metric 
with perturbations can be expanded in the usual Taylor series, 

G^ = G$ + G« + G$ + ■ ■ ■ = , (4.1) 

where the superscripts indicate the order of the term in the perturbations. As- 
sume that the dynamics of the background is determined by the superposition 
of gravity waves h^ v , but somewhat independent of the dynamics of one given 
mode (wave). In that case, Einstein's equations decouple and read 



G$[h] = , (4.2) 
= T$=-G®[h*\ , (4.3) 

where now it is implied that the first equation is to be solved for each mode sepa- 
rately, and that the RHS term on the second equation involves the superposition 
of all the wave modes (the O'^-order term does not involve any perturbations 
at all). Both G$[h] and G$[h 2 ] are evaluated at the background 7^, which 
means simply that both the propagation of the linear modes and their quadratic 
interactions are mediated by the background spacetime. 



We could interpret the RHS of (4.3) as a source, a second-order effective 



energy- momentum tensor r^v for the gravity waves in the background 7^, 
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although it is in reality nothing but the second-order term of the Taylor ex- 
pansion of G^ v . In this form, it is also known as Landau's pseudotensor of 
energy-momentum of gravity^). 

(2) I ^ 

This effective energy-momentum tensor (EEMT) is conserved, G^i — 0, 
as it should be given that the Bianchi identities are geometric identities for 
the exact metric is the covariant derivative with respect to the background 
metric). The nonlinearities here are evident: the source of the gravitational 
field is a conserved current which is a function of the gravitational field itself. 
In Maxwell's equations, the source is a conserved, charged current, but the 
electric and magnetic fields do not carry any charge. The gravitational field, on 
the other hand, has a "gravitational charge", namely, mass. 

Some comments are in order at this point. First, the typical examples 
of backgrounds that we should have in mind are Friedman-Robertson- Walker 
spacetimes (homogeneous space) and black hole spacetimes (S 2 symmetry). Per- 
turbations around these spacetimes can only be defined with the help of an av- 
erage with respect to the symmetric background coordinates [a; 3 for cosmology, 
angles (9, 4>) for black holes] in a given patch of the spacetime. The average of 
a perturbation Sf should yield zero by definition, otherwise we could re-define 
the background and perturbation as fo^fo + (Sf) and Sf — » Sf — (Sf), where 
() denotes the average, 

tSf\ = / V $V d 3 x5f{x,t) forcosmology , . 

I -tl d6d</)6f[(6,(j)),{r,t)] for black holes. { ' 

Second, all perturbations Sf in these symmetric spacetimes can be decom- 
posed into modes, which are conventionally chosen to be orthonormalized so- 
lutions of Laplace's equations in the background of choice. In flat FRW the 
modes are simple plane waves e ±lkx labeled by the wavenumber k, while in 
Schwarzschild the modes are spherical harmonics Yi m (9,(f>) labeled by the an- 
gular momentum numbers (I, m). We can use the orthonormality conditions to 
simplify integrals of quadratic combinations of the perturbations and turn them 
into sums over the labels, so for example in cosmology we have 



(Sf(x)5g(x))(t) = ^ J d 3 xSf(x)Sg(x) = J d 3 k Sf k (t) Sg k (t) . (4.5) 
In the case of black holes, we would have a sum over the numbers (I, m) in 



the RHS of (4.5) instead of the integral (sum) over k. The conclusion is that 
the averaging of quadratic combinations of the perturbations is equal to a sum 
over the modes of the perturbations. A superposition of modes is, therefore, 
the phase space equivalent of a spatial average. In cosmology, given that the 
amplitudes of the modes have a Gaussian distribution, the sum could also be 
interpreted as an ensemble average. 
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This suggest a rigorous way of deriving the system of equations accounting 
for back reaction: we start with the Taylor expansion of Einstein's tensor, Eq. 



(4.1), and perform an average. The term (G^J) vanishes, since it is linear in 



the perturbations, and we are left with 

G$ = -<<$)) , (4.6) 

where, of course, (G\lv>} - — G\iv since the background metric depends only on 
the variable t, in the case of cosmology, and on r and t in the case of black holes. 
The EEMT can then be interpreted in either one of two equivalent ways: 1) it 
is the correction to the homogeneous component coming from inhomogeneities, 
or 2) it is the sum of the stresses and energies in each momentum mode. The 
remaining equation for the perturbations, G^j = 0, follows naturally from fl4.l|), 
since it decouples from the other terms and thus must be zero by itself. 

The first explicit construction of a consistent solutions to Einstein's equa- 
tions in the form above was the gravitational geon, due to Brill and Hartlejjo|Q. 
They were looking for stable, spherically symmetric superpositions of small am- 
plitude gravitational waves which were held together by their own gravitational 
attraction. 

Clearly, if this configuration is to be stable and nondispersive, the back- 
ground must be highly curved, and therefore it is necessary to demand a lot of 
energy in the gravity waves, that is, their frequency u> cx E must be extremely 
high. 

Although the gravitational geon model ended up depending on an unrealistic 
limitQ, they found that the gravity waves carried a mass and produced an ex- 
terior Schwarzschild spacetime outside of the geon. That work established also 
that the problem of propagation of gravitational waves in curved spacetimes 
need not be treated through the full nonlinear Einstein equations, at least as 



far as the system (4.2)-(4.3) is concerned. 

At the same time Brill [[3l[ completed an investigation of gravitational waves 
in closed universes, and he concluded that gravitational waves in an empty, 
closed universe would cause this universe to collapse. Furthermore, the time 
development of a universe filled with HF gravity waves is identical to that of 
a universe filled with radiation, that is, p r = p r /3. This was further evidence 
that gravitons, at least in the HF (or geometrical optics) limit, are just massless 
particles with 2 transversal degrees of freedom, like the photon. 

After the ADM formalism Q was developed in 1964, a series of papers []33| 
found that the mass-energy of any nonsingular gravitational wave configuration 

1 Wheeler, who s ugg ested the problem, had for some time been constructing electromagnetic 
and neutrino geons[p2| , with the intent of examining the problem of gravitational collapse in 
black hole physics. 

2 The gravity waves had to be all propagating over an infinitely thin spherical shell, and 
their wavelength had to be infinitely small compared to the radius of the shell. 
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that is bounded by flat spacetime is positive definite (this result was later gener- 
alized to any asymptotically flat gravitating systems). Interestingly in his 1964 
paper Brill also formulated situations in which the interaction energy between 
distinct bundles of gravity waves is negative. As we will se in later chapters, the 
energy density in some cosmological perturbations corresponds to interaction 
energies, and have a negative sign as well. 

4.2 The geometrical optics limit and gauge in- 
variance of the EEMT 

One of the difficulties that arose during those years is the gauge problem^: 
small transformations of the coordinate system induce gauge transformations 



of the metric perturbations - see Eq. ( 2.32 ) - and since the EEMT is defined 



in terms of the metric perturbations, it should then also change under a gauge 
transformation, r jiv \h?\ — > f^ u [h 2 ] ^ T liv \h?\. Therefore a change in the coor- 
dinate frame would induce a change in the EEMT. This implies, e.g., that the 
mass of the geon as calculated by Brill and Hartle would not have an invariant 
meaning, assuming different values in the different frames used to describe the 
gravitational waves. 

In 1968 Isaacsonjl] (see also |35|, |37], |38| ) came up with the answer for 
this important question: he showed that in the limit of HF gravitational waves, 
the effective energy-momentum tensor averaged over space and time is gauge 
invariant. In other words, assume that the gravity waves are HF, and calculate 

(2) 

their effective energy-momentum tensor, G^J. The wavelengths and periods of 
the waves, I, are much smaller than any corresponding background scales L by 
assumption, so we can proceed to averaging over the space and time coordinates 
on scales bigger than I but smaller than L (see Fig. 4.1). Isaacson showed that 
the result of this calculation is independent of coordinatization, under a few 
weak conditions |jl|, E35l |36], [37|. 

We will now provide a new proof of this theorem, which does not rely on 
the extensive algebraic manipulations needed in previous proofs. In order to 
do that we introduce some formalism borrowed from functional analysis by De 
Witt |39| that greatly simplifies this and other calculations (see Appendix A) . To 
facilitate the discussion we also drop tensorial indices whenever unambiguous, 
so Gfj, v -> G, g^ -> g etc. 

We will be interpreting the functions w(x) — w[d/dx,g(x)] (Einstein ten- 
sor or Riemann curvature, for example) defined on the manifold A4 as the 
parametrized set of functionals w x defined on the space of functions g{x') ac- 
cording to the formula 

3 See [M for a brief discussion. 
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Figure 4.1: A background spacetime with perturbations superimposed. In the ge- 
ometrical optics limit, the wavelength I of perturbations is much smaller than the 
curvature radius of the background, L. 



w x = w[d/dx, g(x)] = / w[d/dx' , g(x')]5(x — x')dx' , 



(4.7) 



where 5{x — x') is the Dirac delta function. Then the functional derivative 
Sw x /Sg(x') can be defined in the standard way 



Sw x = 



Sw x 



—5g(x')dx' 



(4.8) 



Sg(x') 

where Sw x is the change of the functional w x under an infinitesimal variation of 
g{x'): g{x')^g{x') + 6g{x'). 

If, for instance, w x = g{x), then 



5w x = 6g(x) = / S(x — x')5g{x')dx' , 



and comparing this formula with (4.8) we deduce that 

5w,, 



Sg(x') 



5{x — x ) . 



(4.9) 



(4.10) 



As another example, consider w x = d 2 g(x) / dx 2 . Using the definitions (4.7) and 
(O) one gets 



5w„ 



d 2 



Sg(x') dx' 2 



S(x - x') 



(4.11) 



In the following, the functional derivative F xx > — 5G X / 8g(x') will be treated 
as an operator which acts on the function /(a/) according to the rule 
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F xx .-f{x') = / F xx ,f{x')dx' (4.12) 



We will also use DeWitt's condensed notation [|39| and assume that continuous 
variables (t,x l ) are included in the indexes, e.g., A a (x l ,t) — A^'^^ = A m , 
where m is used as the collective variable ("field index") to denote (a, x l ,t). In 
addition we adopt as a natural extension of the Einstein summation rule that 
"summation" over repeated indexes also includes integration over appropriate 
continuous variables, e.g., 

A m B m = A^' x ^B {a ^ t) = I A a (x,t)B a (x,t)dxdt (4.13) 

a J 

We shall write functional derivatives using the following short hand notation: 

6^) = S^ = W > m ' (414) 

For instance, the formula that we are interested in for the proof of gauge invari- 
ance of the EEMT of HF gravity waves, 

C,w{x) = J M^jQctgtf) , (4.15) 
in condensed notation takes the form 



Ctw = w, m ■ {C i9 ) m . (4.16) 

We will se later (Chapter 5) that this formula is a trivial consequence of the 
diffeomorphism invariance of the theory. It expresses the geometrical fact that 
the infinitesimal transformation induced on a tensor functional w[g] by a gauge 
transformation can always be expressed in terms of the infinitesimal change in 
the functions g. 

As further examples of the formalism, the Taylor expansion of the Einstein 
tensor in the metric perturbations can be written in this formalism as simply 

G^[h] = G, m \^-h m (4.17) 

and 

G&[h 2 ] = \ G, mn \^h m h n . (4.18) 

We now proceed to prove that the EEMT of HF gravity waves is gauge 
invariant. First, we note that 4-divergences can be integrated out under a 
space-time average, i.e., they do not contribute to the effective energy and stress 
(t^iv) of the gravity waves [Q. For cosmological perturbations which are small 
wavelength but low frequency the average in time cannot be done, but the 
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average in space guarantees that we can still integrate by parts expressions 
containing spatial derivatives. 

We will now show that under a gauge transformation that takes the metric 
perturbations h — » h — (£^7) [see Eq. ( 2.32] )] the corrections to the EMT are 
of the form of a total derivative and thus, in the HF limit, can be dropped in 
expressions containing the averaged (effective) EMT. 



For the metric g = 7+/1, the EEMT is given by Eq. ( |4. 18| ) . If it is the gauge- 
transformed metric g = 7 + h — (£57) that we are considering, the expression 
is now simply 



1 



(4.19) 

Using the identity (4.16) we can integrate by parts the ^-dependent correction 
terms and get 



= 2* 



\G, 



-A ' [G, 



•(A-7) m 

• h m ] + G, 



2 " 

(£ £ • hy 



(4.20) 
(4.21) 



Now the expression between square brackets in the first term is just C^G[j], 
which vanishes by the equations of motion for the background G[y] = 0. The 
expression inside the square brackets in the third term is just the equations of 
motion for the perturbations h, therefore it also vanishes. We are left then with 
the expression 



- r = G, 



{Cfh) m ~ l -{c r CriY 



(4.22) 



Now all that is left is to show that this is a total derivative. This is easy to 
do if we notice that G^ u , m h m = R^ , m h rn - l/2j fll/ R, m h m - l/2hp V R[i\, but 
for gravity waves in vacuum R[y] — R ym h m — 0, and therefore the Einstein 
tensor reduces to the Ricci tensor. Now, the Ricci tensor for a general metric 
perturbation Sg is given by 



R^v , m Sg m = DSg„ v + 5g a a ^ v - 5g a ^ va - Sg1 lfia 



(4.23) 
(4.24) 



that is, a 4-divergence. Therefore, from Eq. fl4.22p we obtain for the metric 
perturbation Sg = £5 • (h — £5 • 7) that 



j. I V 



>= 



(4.25) 

where -C ■ • • 3> denotes an average over both space and time, over scales bigger 
than the wavelengths and periods of the HF gravity waves. This completes the 
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proof of the gauge invariance of the effective EMT of high frequency gravita- 
tional perturbations. Notice that the proof is only valid for HF perturbations 
because we used integration by parts in time, which is not justified when the 
perturbations are low frequency. We also remark that if the EMT of some mat- 
ter fields f m is such that T[/] i m 8f m can also be expressed as a total derivative, 
then the proof above is valid for a non-vacuum spacetime with that matter. 

The physical content of the proof is simple: HF gravitational waves corre- 
spond to highly localized gravitons that do not "feel" the background curvature, 
and therefore their short-time dynamics is essentially equivalent to that of gravi- 
tons propagating in a flat background spacetime. It is not surprising then that 
in this limit, called the geometrical optics limit, the energy and momenta of 
the gravitons have an invariant meaning in the class of reference systems that 
describe the background. Only in cases where coordinate transformations can 
actually fuzzy the localizability of the gravitons, or the notion of their energy, 
we are going to find difficulties with the interpretation of the EEMT. 

A crucial drawback of this proof of gauge invariance of the EEMT is that 
it does not apply when matter is present, being valid only for vacuum space- 
times with gravitational waves. Scalar field and metric perturbations are then 
excluded from the proof, although they are the driving force behind structure 
formation and actually contribute much more than gravitational waves to the 
energy density of the universe. 

Besides this weakness, there is at least one situation of interest in which the 
geometrical optics limit fails to apply: cosmology. As we saw in the theory of 
cosmological perturbations, there are physically relevant perturbations with a 
gauge invariant meaning on scales bigger than the horizon, i.e., with wavelengths 
bigger than the curvature radius H~ x . The amplitude of these perturbations 
is virtually frozen, which means that their period is also much bigger than the 
time scale of the background, if -1 . 

The reason why we want to study the back reaction from perturbations 
which are bigger than the horizon during inflation is that the scales outside 
the horizon during inflation actually correspond to the present scales of the 
apparent horizon of the universe and beyond. As we saw earlier, the Hubble 
radius when there were 50 e-folds left before the end of inflation correspond to 
the present apparent horizon. In other words, the region of the universe that 
emitted the CMBR that we measure today originated all from inside a bubble 
of physical radius H (t^o) at time £50, expanded to become a region of radius 
H— l(£so) x e 50 containing some e 3x50 Hubble-size volumes, then after inflation 
ended the horizon increased to eventually encompass all these e 150 regions. If 
we focus on a bubble at a time tb < £50, we would be looking later at a region 
of spacetime which is some orders of magnitude bigger than the present radius 
of the observable universe. 

When we set out to solve the equations for the background in one of these 
bubbles, we simply assume the fact that the description in terms of a scale fac- 
tor and a homogeneous background scalar field is valid inside the entire bubble 
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Figure 4.2: After a time corresponding to n e-folds of the scale factor, a bubble of 
initial radius R b (t b ) = H' 1 ^) has inflated to the size R b (t) ~ H' 1 {t b )e n(t) . The 
small circles correspond to horizon-sized spheres of radii H^ 1 ^). Perturbations that 
exited the horizon at a time tn k have a wavelength I much smaller than R b (t) but 
much bigger than the Hubble radius H~ (t). 

of radius R b {t) = H~ 1 {t b )e t ~ tb , for all future times t > t b . We wish to go 
beyond this calculation, and take into account the contributions from all per- 
turbations inside the bubble. Whatever goes on outside the inflating bubble 
is of exponentially small consequence to processes inside the bubble, since the 
rate of expansion of the bubble creates physical space at a rate that quickly 
obliterates any outside processes and restrict their effects to the region close to 
the boundary Rb(t) of the bubble. We say then that the physical radius of the 
bubble, Rb{t), is also a particle horizon for that bubble. 

The perturbations whose wavelengths are smaller than we call small- 

wavelength and HF (high frequency) , while the ones which are bigger than that 
[but smaller than Rb{t)] are called long wavelength and low frequency (LF) 
(see Fig. 4.2). The problem of back reaction of HF perturbations should also 
include the back reaction of quantum fluctuations and the renormalization of 
the stress-energy tensor, which is are much more complicated issues that we do 
not consider here. On the other hand, the back reaction of LF perturbations 
can be treated as a completely classical problem (although their generation is 
an inherently quantum process), and this is in fact the subject of the present 
work. 
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The perturbations created during inflation have a wavelength much smaller 
than the particle horizon Rb(t) of the bubble, but the ones that we are interested 
in have been redshifted out of the Hubble horizon and thus have, obviously, a 
wavelength bigger than the Hubble horizon. When we talk later about the 
back reaction of scalar perturbations during inflation in the chaotic inflationary 
model, it is with respect to these LF, classical perturbations. 

A large class of cosmological perturbations are then not HF, which destroys 
the purpose of the integral over time in the space-time average. Without the 
integral over time, the proof of gauge invariance of the EEMT is invalid. Besides, 
when there is matter filling up the universe, there is again no guarantee that 
the EEMT is gauge invariant, even for HF perturbations. But without a simple 
gauge-invariant meaning to the EEMT, how can we make sense of the problem 
of back reaction? This will be the topic of the next chapter. Now we review 
some recent attempts at back reaction. 



4.3 The averaging problem 

The back reaction of perturbations on smooth backgrounds is a small set of a 
more generic class of problems, known as the "averaging" or "fitting" problem [[Io[ 
[OJ. Locally the universe is quite inhomogeneous, with the largest structures 
that we see on scales of 100 Mpc, compared to 3000 Mpc for the radius of the 
observable universe. We assume that if the inhomogeneities were smoothed out 
evenly throughout the whole universe, the time evolution of this homogeneous 
spacetime obtained by averaging would not be very different from the true, inho- 
mogeneous universe. However, nobody has ever provided an explicit covariant 
procedure whereby an inhomogeneous universe is mapped into a homogeneous 
one. 

The more basic problem is, whether and how a general relativistic system 
can be split at all into background and perturbations, and what dynamics the 
averaged fields follow. It can also be formulated as the loose statement that the 
averaged dynamics is different from the dynamics of the average. 

In the context of cosmology, the assumption is that on a sufficiently large 
scale the universe is isotropic and homogeneous, therefore on these scales the 
split into background and perturbations is justified. This is true as long as 
the density contrast dp/p and the metric perturbations (or, equivalently, the 
newtonian potential) are much smaller than one. However, since very early 
times in the history of the universe there are structures which have gone into the 
nonlinear regime, i.e., the perturbations on the region around those structures 
are bigger than one. Therefore, in cosmology the averaged variables used to 
describe homogeneous Friedmann universes are obtained by construction rather 
than by derivation. 

The averaging problem has been explored also in Newtonian cosmology p^|, 
where the situation is much simpler than in the relativistic case. For instance 
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the gauge freedom of the perturbations, which makes the analysis involved in the 
relativistic case, is substituted by the symmetry under a galilean transformation, 
x — » x + vt, where v is a constant velocity field. The dynamical variables of 
newtonian cosmology are the eulerian fluid variables: mass density pn{x, t) > 0, 
fluid velocity v(x, t) and gravitational acceleration g(x, t). The back reaction of 
perturbations in newtonian cosmology turns out to depend only on the rate of 
expansion V • v and rotation V x v of the fluid, which are clearly invariant under 
galilean transformations. 

Another interesting result from the newtonian theory is that for closed space- 
times the perturbations do not influence in the overall expansion [[f2|. For flat 
spacetimes, the perturbations can influence the expansion on arbitrarily large 
but finite scales, and the average motions are approximated well by a Fried- 
mann model on scales much larger than the scale on which perturbations have 
an effect on the expansion. 

In general relativity the problem of averaging is much more involved than 
in newtonian cosmology, because of the facts that 1) there is no preferred time- 
slicing, and 2) the metric itself is a dynamical variable, which should be averaged 
over. A solution to the averaging problem in general relativity would require 
an as yet unspecified "macroscopic description" (see, for example, Ref. [H), in 
analogy to the macroscopic formulation of Maxwell's electromagnetism inside 
dielectric media. 

Another suggestion that has been madeQ, [l5| is that a inhomogeneous 
spacetime manifold with initial Cauchy data {M., gij, K^) can be continously de- 
formed into a homogeneous spacetime supported by the manifold, (M., g~ij, Kij) 
(see Fig. 4.3). The procedure is similar to the renormalization group approach: 
the metric on a point of the inhomogeneous spacetime is rescaled according 
to the curvature at that point E^j. The end result of this flow is a homoge- 
neous metric and curvature, corresponding to a FRW universe^]. The procedure, 
though, seems too complicated to apply in any realistic situation 

Most recently the problem of back reaction has been addressed by a number 
of authors in connection with the effect of inhomogeneities on the expansion of 
the universe. 

Futamasej47| analysed the corrections to the expansion law of the universe 
using the ADM formalism. He did not solve the dynamical problem of back 
reaction, rather he focused on the question of local observables such as the ex- 
pansion rate of the universe. Futamase and Bildhauer|Q have looked into the 
problem of back reaction of hydrodynamic perturbations in dust-filled universes, 
and concluded that the influence of high frequency perturbations increases the 
expansion rate of the universe, but that this effect decays rapidly with time. 
In their treatment, they allow for nonlinear density perturbations, as long as 
the metric perturbations remain linear. Their results are reinforced by some 

The procedure is only applicable to closed spacetimes which are topologically S 3 / r - The 
dominant energy condition |p| < p must also be satisfied on the manifold. 
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Figure 4.3: A manifold (M,g,K) corresponding to an inhomogeneous spacetime 
can always be mapped by a continuous deformation onto another manifold which 
corresponds to a homogeneous spacetime (M,g,K). 



numerical work done by Shibata et aZ.j49| on the effect of axisymmetric grav- 
itational waves on a de Sitter background spacetime, which shows that unless 
the gravity waves form a black-hole singularity, they are redshifted and the de 
Sitter spacetime quickly becomes empty. 

Also related to the back reaction program is the subject of local observables 
in an inhomogeneous universe (5^]. One of the most interesting questions that 
have been addressed lately is the variations in local measurements of the Hubble 



constant, and their relation to a globally defined expansion parameter! 51 . This 



might be of crucial importance when comparing with the Hubble parameter 
which will be inferred from the maps of the CMBR anisotropy. 
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Chapter 5 

Finite Diffeomorphisms 



We saw in the last chapter that when the perturbations have a time period 
that is bigger than the expansion time -ff -1 , the effective (averaged) energy- 
momentum tensor (EEMT) is not gauge- invariant, that is, it assumes different 
values in different coordinate frames. This would put in jeopardy any attempts 
to tackle the back reaction of low frequency (LF) perturbations on the back- 
ground, because the problem itself would be ill-defined: which gauge should one 
choose to perform the back reaction calculations? 

The root of the problem, as it stood up to now, was that the second-order 
problem of back reaction was not formulated in a covariant manner up to second 
order. The perturbative expansion of the Einstein tensor was done to second 
order [see Eq. (4.6)], but the gauge transformations obeyed by objects in this 
perturbed spacetime were still regarded as first-order, linear gauge transforma- 
tions. 

Clearly, a consistent, covariant theory of second-order perturbations in- 
cludes gauge transformations that are not truncated at first order. In fact, 
we will be able to formulate the whole problem of a perturbative expansion of 
a diffcomorphism-invariant theory in an explicitly covariant manner (5^, |5^| . 

Calculating corrections to arbitrary orders is equivalent to regarding per- 
turbations as finite corrections, which can be improved in successive orders of 
magnitude in perturbation theory. The parameter e of the second chapter, which 
expresses the strength of the perturbations (e ~ Sp/p), is a finite number in any 
interesting perturbative theory. Furthermore, as we have argued in Chapter 2, e 
is indistinguishable from e, which is the small parameter that sets the size of the 
shift vector e£. Hence the parameter e = e that regulates the diffeomorphisms 
also have to be regarded as a finite number. This leads us to consider the theory 
of finite diffeomorphisms, an issue that was apparently ignored by most authors 
in General Relativity. The formalism can be applied to any relativistic system, 
and relies only on the assumption that there is an unambiguous background 
spacetime with perturbations. 
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5.1 Finite gauge transformations 



A diffeomorphism of a manifold is a transformation on the map Q x : P € M. — > 
x^ 6 5ft 4 of identification of the points of the manifold. We call it a gauge 
transformation \P(A) if the initial map Q x can be continuously deformedfj into 
the map Qx by taking the affine parameter A away from zero. In terms of the 
labels x and x we have 



P 



P 



I *(A) 



By the expression above we mean that the same point P of the manifold, with 
coordinate label ip in the original frame, has a label xp in the tilde (trans- 
formed) frame. 

We can construct explicit coordinate transformations for every point P of the 
manifold M. by considering a congruence of vectors x° — [A, X P (A)] parametrized 
by the affine parameter A in the following wayp9|. First, fix X /1 (P; A = 0) = x p 



and x M 0P;A = e) 



up, 



that is, set the small number A A = e as determining 



how "close" the two coordinate systems are along the curves x (see Fig. 5.1). 

The affine parameter defines a foliation of charts along x, and the tangent 
vector field along the hypersurfaces of constant A is given by 



dx^(P; A) 

dX 



£"(P;A) 



(5.1) 



Now transport the coordi nat e system x along the curve x from A = up to 
A = e, that is, solve Eq. (5.1) with initial conditions x^{P\ A = 0) = x p . The 
solution is given in terms of the Taylor series, 



x P - x ^P ; \ = e )= x ^ P; \ = o) + e -t- 



A=0 



1 



where in the second line we used the identity 



dX 2 



P:X=0 



dr^Jp..x =0 



i 2 av 
2 e ox 2 



A=0 



(5.2) 



(5.3) 



Equation ( |5.2| ) can be written in the compact form 

1 Transformations such as the one that takes a cartesian (x, y, z, t) system into a spherical 
(r, 8, </), t) one are obviously not included in this category. 
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Figure 5.1: Two charts, x and x, related by a diffeomorphism transformation gener- 
ated by the congruence of vectors \- Every point P of the manifold has a representation 
xp and xp in both charts, which are linked by the vectors \- To specify a gauge trans- 
formation is equivalent to specifying either the congruence \ a or its tangent vector £ M 
in the direction of the hypersurfaces of constant A. 



X a P {P-A = t) = {e e ^x a )p , (5.4) 
where we have included the small parameter e in the definition of £. Thus the 

general coordinate transformation x — > x on A4 generated by the vector field 
£ a (x) can be written as 

x a -^x a = ^{\ = e)-x a = e^^x a . (5.5) 

Conversely, given any two coordinate systems x a and x a on M. which are not 
too distant, we can find the vector field £, a (x) which generates the coordinate 
transformations x — > x in the sense ( |5.5|) . This class of coordinate transforma- 
tions is suitable for our purposes, although it does not cover the more generic 
family of "knight diffeomorphisms" of Bruni et al. Hjn. To our knowledge, the 
first to consider gauge transformations to second order was Taub]55|. 

The variables which describe physics on the manifold M. are tensor fields q. 
As discussed in section 2.4, under coordinate transformations x — > x the value 

2 The cited authors use a family of congruences Xm instead of a single congruence \ to 
generate the diffeomorphisms. Since the order in which the congruences are used to transform 
the charts does matter (at least in curved spacetime), the family of diffeomorphism generated 
in this manner appears to be more general than the class of diffeomorphism generated by a 
single congruence of vectors. 
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of q at the given point P of the manifold transforms according to Eq. (2.24), 



q(x P ) - q(x P ) = (£) p • • • (|f) fap) (5-6) 

Note that both sides in this expression refer to the same point of manifold 
which has different coordinate values in different coordinate frames, that is 
xp ^ xp. The question about the transformation law for the tensor field q 
can be formulated in a different way. Namely, given two different points V 
and V , which have the same coordinate values in different coordinate frames, 
that is x-p = xp, we could ask how to express the components of the tensor in 
the coordinate frame i at the point V (denoted by qp) in terms of q and its 
derivatives given in the frame x at the point V . The answer to this question 



is found with the help of the Lie derivative [see Eq. (2.25)] of the tensor q 



with respect to the vector field £ which generates the appropriate coordinate 



transformation x — > x according to (5.5) 



= x ) = [1 - <Q] • q(xp = x Q ) + 0(£ 2 ) . (5.7) 

We wish now to generalize this expression and write a formula for the exact (to 
all orders) diffeomorphism generated by a small but finite vector field £. 

It is useful to notice that the operator is the generator of the algebra of 
the group of diffeomorphisms, in the same manner as the angular momentum 
operators J are the generators of rotations in 3-D space. A quantum mechanical 
state \ip{0)) can be rotated by a finite angle AO through the action of the rotation 
operator e~ %J \ip(9)) = \ij)(6 + AO)). Similarly, the operator realising finite 
diffeomorphisms can be expressed as an exponential of the differential operator 
C^, where the parameter e = A A (included for simplicity in the vector field £) 
plays the role of the angle AO. This operator can be found in some advanced 
books on differential geometry^] as well as in some recent papers on the subject 
of finite diffeomorphisms [pi). Let us call this exponential of the Lie derivative 
the Lie differential operator, 



q(x) -> q(x) 



(e~ £ « 
q(x) - 



(C s •q)(x) + i(£ c .£ ? .g)( a ;) + 0(C 3 ) . (5.8) 



Equations ( p.7| ) and (5.8) are tensor equations, where for notational convenience, 
tensor indices have been omitted. Note that, in spite of the fact that the trans- 
formation law (^T^) is the consequence of transformat ion law ( |5.6| ), they are 
different in the following respect: transformation law (5.6) is well defined for 
any tensor given only at the point P, while (5.5) is defined only for tensor fields. 

With the Lie differential operator we have a simple prescription for the gauge 
transformation of all tensor fields of the manifold. It is valid, in particular, for 
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tensor fields which are functionals of other tensor fields, e.g. A flv [B](x) = 
B il {x)B u {x). Under a gauge transformation both A and B are transformed by 
the Lie operator, 

A — ► A = e~^A , 
B — ► B = e _£ «B . 

However, the tensor field A is the same functional of B as the tensor field A was 
a functional of B, therefore we conclude that 

A = e- c ^A[B] = A[e- c tB] . (5.9) 

Clearly, for this nontrivial identity to be true the Lie derivative has to obey all 
the properties of the usual partial derivatives when applied to tensor fields and 
functionals. For local tensors obtained by sum (A^ = B^ + C^), multiplica- 
tion {A^u = B^Cy), contraction (A^ = B™^) or any combination thereof, the 
composition rule (|5.9|) is a consequence of linearity and of Leibniz's rule, 



C^aA + fiB) = aC^A + pC^B , (5.10) 
£ S (AB) = (CsA)B + A{C^B) , (5.11) 

where a and f3 are constants and A and B tensor fields on the manifold. A 
key consequence of these properties is that the Lie operator obeys the same 
composition rule as differential operators, df(g) — ^dg. Therefore, all local 
tensor fields which are functionals of other tensor fields constructed in the ways 
described above follow the composition rule, 



C ( A[B](x) 



(5.12) 



(see section 4.2 for notational conventions). For the example A^ v = B^By this 
relation can be easily verified by using that 



8B a {x') 



S"6(x - x')B u {x) + B^(x)S^S(x - x') 



(5.13) 



where 5{x — x') is the 4-dimensional Dirac delta function. 

Now we will show that it follows from the composition law ( [5. 12 ) for the 
tensors built by sum, multiplication or contraction of other tensor fields, that 



the Lie operator obeys equation (5.9): 



e'^AlB] 



l-Ce 



:C e C, 



A[B] 
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A - A, m {C t B) a + -A <mn (£ s B) m (£ s B) n + -A <m (£py 



A[(l - C 6 + 



)B] 



Ale'^B] 



(5.14) 



where B can be any collection of tensors labeled by the indices a. The two last 
terms on the second line follow from the identity 



C^A^iC^Br] = A, mn (£ s B) m (£sB) n + A^C^BY 



(5.15) 



Notice that this last equality follows from the Leibniz rule, since C% ■ (A ■ C) = 
■ A) ■ C + A ■ (£^ ■ C) and C — £^ ■ B . Therefore the composition law to 
first order together with Leibniz's rule implies that the composition law to all 
orders, Eq. ( 5.14j) , is satisfied. Since th e com position law to first order is itself a 
consequence of Leibniz's rule [see Eq. (5.12)], we can say that everything hangs 
on the Leibniz rule: if it is valid for a tensor functional, then relation fl5.14 ) is 
also valid. 

We wish now to generalize this result for more generic tensor functionals, in 
particular the ones built with the help of covariant derivatives, e.g. A^ v = B^. v . 
In order to do that we need to verify Leibniz's rule for the covariant derivative, 
that is, we must show explicitly that the metric tensor transforms in such a way 
that the covariant derivative D a [d/dx, T^] acting on any tensor field transforms 
in the same way as a covariant vector B a multiplying that tensor field. 

As before, we only need to verify this to first order, since the result for higher 
orders follows from Leibniz's rule. This long calculation is done in Appendix B, 
and the end result, as expected, is that covariant derivatives indeed obey the 
rule, 



£, ■ {D[±,T\ ■ q) = D[JL,££\ ■ q + D[±, V] ■ (£, ■ q) 



dx ' 



dx ' 



(5.16) 



where (£^L)^„ = (£^T)" V + is the appropriate differential operator acting 
on the non-tensorial connection. This fundamental relation can also be extended 
to higher orders, and expressed as 



d_ 

dx' 



9nv 



= D a 



d_ 

dx 



(5.17) 



In sum: the Lie operator passes through the differential operator d/dx and 
acts only on the metric in expressions which involve a covariant derivative. Since 
we have proved the composition relation for tensors which are sums, multipli- 
cations, contractions and covariant derivatives of other tensor fields, the result 
is automatically proven for any combination of these operations. In this man- 
ner we cover all the objects which we will be interested in for the purposes of 
General Relativity and classical Field Theory. 
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In particular, the composition rule ( |5.17 ) implies that the Riemann tensor, 
defined as Rp^v 13 = [D^, D v ]v a for any vector v transforms in the same way. 
If we regard the Riemann tensor as a functional of the metric, we have that 



e~^{R%M^} = {e-^R)%M{e-^vf (5.18) 
= R a ^ v (e-^9)(e-^vf 

from whence we deduce that 

e - c tR[g] = R[e- C tg] , (5.19) 

a very nontrivial result for a nonlinear, differential functional of the metric such 
as the Riemann tensor. Of course, contractions of the Riemann tensor such as 
the Ricci curvature and the Ricci scalar clearly follow the same composition rule 
(for an explicit proof of the composition law for the Ricci tensor, see Appendix 
B.) 

It is clear now that the composition rule applies to all riemannian tensors, 
the Ricci tensor, the Einstein tensor, the stress-energy of a scalar field and so 
on. Indeed, any of the usual covariant expressions A which are expressed as a 
function of other tensors B follow the rule e~ c ^A[B] = A[e~ Cs B]. We will se 
in the next section that this rule is the base for a covariant formulation of the 
back reaction problem. 

5.2 Finite gauge transformations and general co- 
variance 



Let us consider now what are the implications of the conclusions of the last 
section to the problem of back reaction of perturbations on a given background 
spacetime. Let us first reformulate the problem, to include not only vacuum 
but also matter spacetimes. The Einstein field equations can be written in the 
forrnfj. 

GM-T^fatp] = IL^[g,(p] = , (5.20) 

for the exact metric <? M „ and the exact matter fields (p. Let us denote the matter 
and metric variables collectively as q m , the background variables by b m and the 
perturbations by p m (b would be, for example, the homogeneous scalar field 
ipo and the metric 7^ of Chapter 2, and p the scalar field perturbations Sip 
and metric perturbations Sg^, in the case of scalar field matter in a FRW 

3 It is a trivial but sometimes tedious task to verify that these equations can be written 
equivalently in the covariant, mixed or contravariant form. 
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background). In addition we include the second-order corrections since the 
beginning, and denote them by r m . We have then, with factors of e explicit for 
clarity, 

q m {x , t) = b m {t) + ep m (x, t) + eV" , (5.21) 

where we are keeping in sight that we ultimately wish to attack the problem 
of back reaction in cosmology, where the background variables depend only on 
the time coordinate. For the back reaction problem in black hole physics, the 
background variables would depend on the radius r from the singularity. The 
zero mode of the variables r m will be regarded as the back reaction corrections 
r™(t) to the background b rn (t). 



With the perturbative expansion ( 5.21 ) we can also expand the Einstein field 
equations, 

U[q m ] =n[b m }(t) + en mP m (x,t) + ^e 2 n mnP m p n + e 2 n m r m + 0(e a ) . (5.22) 
To lowest order, we have the background equations 

H[6] = G[b] - T[b] = G (0) - TW> = , (5.23) 

which in the case of cosmology can be solved for the homogeneous background 
metric (scale factor) and matter fields on hypersurfaces of constant time. 

The next order in perturbation theory is the equations of motion for the 
perturbations, 

n mP m = G, m p m -T mP m = G^-T^=0 , (5.24) 
which, again in the case of cosmology, gives the time dependence of the k- 



modes (see Chapter 2). It is important to notice that th e equ ations ( p.24| ) do 
depend on the background solution obtained from Eqs. ( [5.23 ). This is simply 



the statement that perturbations propagate differently in different background 
curvatures, which is obvious but should be stressed in this context. 

To second order in perturbation theory we have that the equations of motion 

are 



\n m nP m P n + H m r m = X - (G, mnP m P n - T mnP m p n ) 

+ G tm r m -T m r m =0 . (5.25) 

Notice that the terms between brackets are simply the EMT for perturbations 
defined in the previous chapter. Notice also that in second order in perturbation 
theory we require that both the th and l st -order dynamics have been solved, 
that is, that the background and linear perturbations are known to within a 
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level of accuracy of order e. This sequence can be carried to arbitrary orders 
in perturbation theory, with the caveat that at each order it is necessary to 
introduce additional degrees of freedom for the metric and matter fields. 

We will show now that these equations have a covariant meaning, that is, 



that a gauge transformation does not change the content of Eq. (5.22). Under 
a gauge transformation, the fields q transform as 

q m _> q m = (e- £ «q) m . (5.26) 

The Einstein equations in the coordinate frame x are the same tensors, written 
in terms of the metric and matter fields in that coordinate frame, 

n = n[ e - £ «q] = e - £ «n[<j] = o , (5.27) 

where in the last equality sign we have used the equations of motion written 
in the coordinate frame x, H[q] = 0. We also made use of the "composition 
rule" derived in the last chapter to pull out the Lie differential operator. The 
conclusion is: in both coordinate frames x and x, the content of the Einstein 
equations is the same for a given exact spacetime. In other words, we have 
verified explicitly the general covariance of Einstein's General Relativity and 
found simple formulas that relate the form of these equations in each coordinate 
frame. Since the same reasoning applies to any equations written in terms of 
tensors on the manifold, this result applies to any covariant theory with or 
without matter, such as higher derivative (R 2 ) theories of gravity, scalar-tensor 
theories, dilaton gravity etc. 

To make matters more clear, let us verify explicitly that, to second order, 



the Einstein equations (5.22) are a covariant set of equations. The gauge trans- 



formation of the field variables q is, expanding (p. 26]), 



{e-^q) m = [l-C 6 + -Cl) (b+p + r) m + 0(e 3 ) , (5.28) 



where we assume that £ and p carry a factor of e and r carries a factor of e 2 . 
Moreover, we assume that, for our purposes at least, the vector £ averages to 
zero (this will be explained shortly). Sorting the above equation by orders of e 
we have 



b m = b m (5.29) 

f™ = r m - (£ € p) m + ^(Cp) m . (5.31) 

Notice that the background metric and matter fields b(t) are unchanged by the 
gauge transformation (this is actually one of the basic assumptions - see Chapter 
2). The perturbations p(x, t) are transformed in the usual way, through the Lie 
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derivative of the background. Since one of the key assumptions is that the 
linear perturbations average to zero in all coordinate systems that we wish to 
consider (see, again, Chapter 2), it is clear that the vector £ also averages to 



zero, (£) = 0. The novelty comes from Eq. (5.31): the second-order variables r, 
which will be used to describe the feedback from the perturbations, are changed 
in a nontrivial manner that involves Lie derivatives of both the perturbation 
and the background variables. 

It is obvious that the zero and first order equations are covariant (see, e.g., 
Section 2.5). Let us see how the second-order equations of motion are rewritten. 
In the transformed reference frame we have, to order e 3 , 

l -n, mn P m P n + n m r m 



-n mn ( P m - c^b m )( P n - c ( b n ) + n m r m - c (P m + -cp 



\n mnP m P n + n m r m - c £ (n mP m ) + n m (c iP y n 
-~£ € \n tm (Ctb) m ] - \n m {c\b) m - n m (c iP ) m + \n m (cp) 

= mn p m p n + n m r m = , (5.32) 

where we have used the equations of motion for the background = II[6] = 

and for the perturbations II m p m = to cancel some terms. We have also used 
integration by parts, e.g. /QII m p m = W tmn p m {C^b) n + U, m (C(p ) m . We see 



then that the content of the equations in one reference frame, (5.25), is the same 



as the equation in another frame, (5.32). This completes the proof of general 



covariance of the back reaction equations. 



5.3 Background variables, averaging and covari- 
ance 

So far we have only established that the general problem of feedback of per- 
turbations is well defined, at least before any average is taken. As we showed 
in the last chapter, averages are fundamental to the problem of back reaction. 
However, it is conceivable that the average might break the covariance of the 
problem, since it might pick a preferred time-slicing. 



Equation (5.25), as it stands, involves all quadratic nonlinearities, from the 
back reaction on the homogeneous background to the second-order corrections 
to linear perturbations coming from mode-mode (kk') couplings. Although the 
subject of second order perturbations is a very interesting one (see, e.g., Ref. 
[p7|), it is not covered here. We are interested instead in how the perturbations 
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can affect the so-called zero modes (that is, the variables that describe the 
homogeneous component of the spacetime.) 

The second order perturbations can be separated into two categories: the 
zero modes r™, which do not vanish under spatial average, and the inhomoge- 
neous k-modes r™ (k comoving), which can actually be regarded as corrections 
to the perturbative modes p™. We are not interested in the modes r™, as 
we said in the last paragraph, so we concentrate on the homogeneous modes 
rjj*(t) = (r m ). We want to write down the equation for the zero modes. Clearly, 



an average performed over (5.25) will pick all the homogeneous contributions, 



and that should be regarded as the dynamical equation for the zero mode, 

(n, m r m > = n m c = ~(n mnP m P n ) = T , (5.33) 

where we have retrieved the notation that the term on the right is the effective 
(averaged) energy- momentum tensor (EEMT) of the perturbations p m . We also 
used the fact that II. m r m on the LHS is linear in r m , therefore the average of 
everything but the zero modes vanish. 

In the literature, the terms on the left of Eq. ( (5.33] ) are usually taken as 
simply the background equations, so that G|S = + rff and the EEMT 
can be interpreted as a homogeneous stress-energy contribution from the per- 
turbations. However, when solving for back reaction using these equations we 
assume that the background has already been solved and that back reaction 
will at most contribute with small deviations from this background. There- 
fore even in this simplified version the assumption is that we will eventually 
solve for the back reaction equations perturbatively, i.e. as in the format of 
Eq. ( 5.33| ). Furthermore, we have no a priori reason to believe that the back 



reaction variables r™ have the same symmetry structure and degrees of free- 
dom as the background variables 6™. The typical example would be a de Sitter 
spacetime background with an EEMT that breaks time translation invariancc. 
The perturbative format of the back reaction equations is then the most useful 
and well defined. 

Notice that gauge transformations hav e a s urprising effect on the zero modes 
r a , which can be retrieved from formula ( |5.3l| ), 

C=C _ ((£?p)m)+ l ((£ 2 6n ) (5 34) 

where the average picks the zero modes from the second and third terms in the 
formula. This is a fundamental change with respect to previous treatments of 
back reaction: the realization that quantities with respect to the homogeneous 
mode have different expressions in different coordinate systems. The r™ are in 
fact corrections to the background variables 6™ (t) , hence an accurate description 
of the background is attained only with b™{t) + (e 2 )r™(i). If the r Q 's change 
under a gauge transformation, it means that the background itself is redefined 
at second order by a gauge transformation, 
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b m (t)+C(t) ^ b m (t)+C(t) ■ (5-35) 

Even if r™ = initially, after a finite gauge transformation it is no longer zero. 

An interesting example is the case of a coordinate frame x the observers of 
which see the universe as an exactly homogeneous spacetime, p m = r m = . . . = 
0. Suppose now that we perform a gauge transformation, that is, change the 
set of basic observers at x for whom the universe is perfectly homogeneous, to 
a set of observers at 5; M = x 1 * + £ M who now see "gauged" perturbations 

p™ = p rn - (£ c b) m = -(£ c b) m , (5.36) 

and b m are the homogeneous background variables measured by the observers at 
x. Now, do these "gauged" perturbations have an effect on the (inhomogeneous) 
background in which they propagate? If the "true" spacetime does not have any 
perturbations, how could there be any back reaction at all? 

Consider the back reaction equations in each reference frame. In the original 
frame there are no back reaction equations, since the variables are all zero, 
except the background ones. In the barred frame, though, the equations are 

n m f n = - l -n mn p m p n = o =► f m ^o . (5.37) 

From this equation we find that there must be a non-vanishing back reaction 



effect in the barred frame. In fact, it is easy to see from formula (5.31) that the 
back reaction in the barred frame is given by r m — ^(£^b) m . This expression 
indeed solves exactly the back reaction equations of motion, 



n m -(£ 2 c b) m = --n mn (C c b) m (£ ( b) n . (5.38) 



That the equation above is an identity can be seen by integrating the Lie 
derivative by parts and using the equations of motion for the background, 
n m (£ c b) m = £ c Il[b} = 0. 

Of course, if we did not know that the LHS of the back reaction equations 
change under a gauge transformation, we could never make sense of the fact 
that the EEMT assumes different values on different reference frames. The 
back reaction variables f m = ^(£^b) m are in fact indispensable to guarantee 
that the perturbed spacetime seen by observers in the frame x corresponds to 
a homogeneous spacetime in the original frame x at all later times. 

It can be checked that the Einstein equations in each perturbative order are 
zero identically (i.e., they are void of dynamical content), provided that the 
referential spacetime was perfectly homogeneous. We could also ask the con- 
verse question, namely: given a set of perturbative solutions to Einstein's equa- 
tions to all orders, is it possible to find a homogeneous spacetime corresponding 
to it? Moreover, is this homogeneous spacetime unique (up to a trivial time 
reparametrization) , and what is its significance? We are not going to answer 
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these broad questions here (see Q), although by the end of this chapter the 
problems connected with the formulation of the back reaction equations will be 
clarified. 

In this spirit, let us verify now that the back reaction equations for the zero 
mode are covariant. This is not a trivial matter, as it might appear at first 
sight. We want to prove that 



n™C = --<n 



mnP P / 



n 



i'o — 2> > m n " " 



(5.39) 



with given by formula ( 5.34 ) and p m by ( 5.3C ). Substituting these expres- 
sions into the equation on the right of (5.3E) we have 



n m C + ^<n mnP m P n ) = n m r™ + i(n mnP m P n ) (5.40) 
- n m (c iP y n + l -u m (cp) m + (n ro (£ €P ) m ) - l -(u m (c\b) m ) 

where, as in Eq. ( |5.32| ), we have integrated Lie derivatives by parts and used 
the equations of motion for the background and perturbations. 

We assume now that the averaging does not introduce any extra time de- 
pendence, i.e. d/dt(f) = (df/dt). In that case the terms in the second line 
cancel, because the differential operator H tm is linear on its arguments, and 
therefore the averaging operation commutes with the action of n m over (C^p) m 
and (£|6) m . The fact that the differential operator n m has the laplacian built 
on it is irrelevant, since we are interested in the homogeneous modes whose 
spatial derivatives are zero, and the inhomogeneous parts fk with k ^ vanish 
under average. In other words, as long as the averaging does not alter the time 
dependence, the zero modes of the linear operator II im evaluated on a func- 
tion are the same as the operator evaluated on the zero modes of the function, 
(n, m / m ) - n m (/™). We have then 



n m r™ + ±(n mn p m p n ) = n m c + ±(n mnP m P n ) = o . (5.4i) 

It is easy to imagine examples of averaging procedures which do introduce 
"extra" time dependencies. Take, for example, an average over the domain 
V(t) between the spherical shells at comoving radii R uv {t) = e~ m H- x and 
Rib. = Ri constant. (See Fig. 4.2). The small radius Rjjv(t) corresponds to the 
Hubble horizon of a de Sitter spacetime in comoving coordinates R = r/a(t), 
and the large radius Rir is the comoving radius of some initial bubble, as 
defined in the last chapter. This average corresponds to a set of perturbations 
whose spectrum has both ultraviolet (UV) and infrared (IR) comoving cut-offs 
kuv(t) — R^jyit) and km = RJr, so we write 
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Figure 5.2: Perturbations generated during an inflationary epoch exit the Hubble 
horizon i7 _1 /a(i) at all times. The number of comoving momenta included in the 
spectrum of the "classical" perturbations are therefore increasing with time, for ex- 
ample kuv oc e Ht during inflation. 



r{t) 



kiR 



dk 
T 



(5.42) 



where Sf(t) is the power spectrum of the perturbation (j>k(t) (see Section 3.3). 
Clearly, the averaging does not commute with a time derivative, because the 
upper limit of the integral is time dependent. Of course, this time dependence 
can be very small in some cases, so that in perturbation theory we can ignore 
it. This is indeed the situation that we will encounter when we analyse the back 
reaction of scalar perturbations in the chaotic inflation scenario (see Chapter 
7.) 

However, let us argue that an averaging procedure which introduces a non- 
negligible time dependence is not acceptable for reasons other than the fact 
that it does not yield a covariant back reaction problem. The reasoning is best 
followed in momentum space. Take the expression ( ^42| ), in which the UV 
cut-off is time dependent. The integration sums up the contributions from all 
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fc-modes, from the longest comoving wavelength kj^ to the shortest wavelength 
k^jy . The fact that the UV cut-off is time dependent means, though, that the 
average at a time t + At will include more modes than the average at time t. In 
other words, at later times more (short-wavelength) modes have been "created" 
in the calculation. Where have they come from? 

In inflation models, these modes have just exited the horizon, and from that 
point on we regard them as classical perturbations, given that the quantum fluc- 
tuations for those scales have a very high occupancy number. For the purposes 
of the classical calculations, what that means is that at every moment t during 
inflation there are modes being created, with comoving wavelengths with the 
size of the horizon scale, H~ 1 (t)/a(t) (see Fig. 4.2). Hence, there is effectively 
a source added to the stress-energy tensor that creates the small-scale perturba- 
tions, and this source is not conserved in the same sense that the stress-energy 
tensor is conserved. In the case of inflation, we will show later that the creation 
rate is very small and can be neglected in the conservation equations. 

If this source is too strong and effectively destroys the conservation equa- 
tions, some more fundamental problems appear. Mainly, if the stress-energy 
tensor is not conserved anymore, as a consequence the Bianchi identities are not 
satisfied and it is not even clear if Einstein's equations can be trusted anymore. 
Clearly, we would like to avoid such a radical departure from standard physical 
practice, so we concentrate on situations where the conservation equations are 
obeyed. 

Summarizing, we found that the second order equations for the zero modes 
(the so called back reaction of perturbations on the background) can be for- 
mulated in a covariant manner. This was done with the help of the theory of 
finite gauge transformations, and a crucial ingredient is that the zero mode vari- 
ables r™ are rescaled by a gauge transformation. We found that the long-held 
assumption that the background is unchanged by a gauge transformation does 
not hold to second order. 

In this context, the paradox that the EEMT is changed under a gauge trans- 
formation could be resolved: if on the one hand the EEMT changes, on the other 
hand the zero modes that describe back reaction also change, in just the right 
way such that the problem of back reaction is well defined. 

5.4 Gauge invariant variables to higher orders 

We now want to go beyond this interesting but standard covariant construction, 
and try to build variables that can describe back reaction in a gauge invariant 
way. We saw in Section 2.5 that it is possible to build gauge invariant variables 
that describe perturbations, 

pm = p ™ + (£ xb yn ; (543) 

where the vector X was suitably defined in terms of its transformation laws, 
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x» x» = x^ + c 



(5.44) 



We found earlier that we can express X in terms of the metric perturbations 
in a generic gauge, and that it is also a "small" vector cx e, in the same sense 
that £ and the perturbations are small. Moreover, by fixing the gauge such that 
X = by taking B = E = 0, the gauge invariant variables P rn would take the 
same value as the variables in longitudinal gauge, pf 1 . 

This procedure can be extended to second and higher orders, now that we 
know the gauge transformation laws to arbitrary orders. Inspired by the results 
in first order in the perturbations, consider the following candidates for gauge 
invariant variables: 



QT = {e Lz q) m , (5.45) 

where the Lie differential operator is formally defined as before, but evaluated 
with £ = —Z. In a transformed coordinate frame x, Q m would be defined by 
Q = e c zq. 

We want the Q's to be gauge invariant, that is, their expressions must be 
equal in both reference frames, Q m = Q m . Using the ansatz above we have 

Q = e c zq = e c z e - c tq = e Cz q = Q . (5.46) 

The Lie derivatives cannot be summed in the exponential because, due to their 
different arguments (Z and £), they do not commute. We can use the Campbell- 
Baker-Hausdorff formula for the exponential of an operator and get the condi- 
tions on the vector Z, 

Z^ = Z^ + ^ + hz,^ + 0{e 3 ) , (5.47) 

where [Z,iY = Z^ v - Z v ^ v is the commutator of the vectors Z and £. To 
derive this formula, it is useful to note that the Lie derivative of a contravariant 
vector is given by the commutator of £ with that vector, (C^A)^ = [A, 



If we ignore the third term on the RHS of (5.47) we retrieve the definition 
of X used in the construction of the gauge invariant variables to first order (see 
Section 2.5). The commutator gives a correction term of order e 2 , which must 
be incorporated into the definition of Z if we want to define gauge invariant 
variables to second order. Higher order terms like that must also be included 
as we proceed in perturbation theory. To second order we have, perturbatively, 
Z = eX + e 2 Y + 0(e 3 ) and their transformation laws should be 



X[p] 



y[f,p 21 



(5.48) 
(5.49) 
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where by "K[r,p 2 ] we mean that the vector Y is in general a linear function of r 
and p 2 , by the same token as X is a linear function of the p's. 

The gauge invariant variables for perturbations and background to second 
order are defined then by the formulas 



iC = rT + {(C xP ) m ) + {(CYb) m ) + l{(C 2 x b) m ) . (5.51) 

We still should determine what the second-order vector Y^ is. The equations 
that determine its expression, though, are very complicated, and there may 
not even be an answer that is local on the variables r m and p m . It would be 
important to find an explicit expression for Y , but we may be able to set it to 
zero. 

The argument is the following. Consider the relationship between gauge 
invariant linear perturbations and the perturbation variables of a fixed gauge. 
To linear order, we saw that there is an infinite number of gauge invariant 
variables, corresponding to the infinite number of ways of choosing the vector 
X. Each set of gauge invariant variables is equal to the perturbation variables in 
one specific gauge, for example in longitudinal gauge 4>i = $^ and tpi = <]/( SI ), 
where the variables in the right-hand-sides are the gauge invariant variables 
found through the use of the vector Xj* = [B t - E' u -W i E{\ = 0. The only 
requirement is that the constraints on the metric (Bi = Ei = for longitudinal 
gauge) remove all the gauge freedom and force the shift vector £ = 0. As long 
as a gauge is completely fixed, there are gauge invariant variables corresponding 
to the perturbations in that gauge. 

With respect to the background variables r a the same rationale is applicable: 
the variables in any well-defined gauge are equal to some set of gauge invariant 
variables. If the constraints on the metric perturbations demand that £ is zero 
to first order, all we need are second order conditions on the r m that fix £ to 
an accuracy of order e 2 . In other words, we must impose constraints on the 
second-order quantities r m such that no variations of £ of order e 2 are allowed. 
Let us split the shift vector in the form £ M = e£ M + e 2 a^, so that the first-order 
constraints should fix £ M = and the second-order constraints should fix = 0. 

Let us assume that the first order constraints have been imposed, so we only 
have to worry about the second-order ones. The background variables r™(t) 
change, under a gauge transformation with the shift vector c^, as 



'(f) = r™(t) - (£„&)' 



(5.52) 



eh uv (x, T))- 



Call the second-order metric perturb ations j M „, that is, g^ v — 7^1/(17) 1 
e jVi/(f) where 7^ is given in Eq. (2.19) and the perturbations are the usual 
ones (of the scalar, tensor and vector kinds). The most general form of j that 
we can consider is, 
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*— 'HSm 1V.J • <"») 

in which case the gauge transformation laws with a give 

2(i) = z (t) - i[ a( r ]' , (5.54) 

«*(*) = WiW-ofi + o-i' , (5.55) 

w(t)8 l j = w(t)Sij + —a°5ij + ■ (5.56) 

If we demand that both sides of the equations are homogeneous functions, from 

( |5.54 ) we immediately see that a must be a function only of time. Using this 



in Eq. (5.55) we deduce that a 1 should also be a homogeneous function, (Jj(t). 
Inserting this into Eq. ( 5.56| ) we see that the gradient terms <Tjj drop out, and 



the gauge transformations are finally 

1 



z{t) = z(t)--[aa°}' , (5.57) 



a 



Vi(t) = Vi{t)+ai' , (5.58) 



w(t) = w(t) + —a . (5.59) 
a 

We can now fix the gauge to second order, for example by demanding that 
w = Vi =0. These constraints fix the homogeneous functions cr°(t) to zero and 
<7j(i) to an irrelevant time-independent constant. 

The conclusion is, as long as we fix the gauge in first and second orders, the 
problem of back reaction will be well defined, since there are gauge invariant 
variables corresponding to the perturbations p m (x, t) and r™(t). In addition, we 
know exactly how these quantities transform when we go to a different gauge, 
hence we can compare the dynamics of back reaction in one gauge with the 
dynamics in a different gauge. We have shown that the results are identical 
when it is gauge invariant quantities the ones which are compared. 

From now on we only use gauge invariant quantities in the treatment of the 
problem of back reaction. We have then, 

n m iC = -\{n mn p m p n ) , (5.60) 



where the variables R m were defined in Eq. (5.51) and P m in (5.50). The RHS 



of this equation is the gauge invariant effective energy-momentum tensor. 
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Chapter 6 

The Back Reaction of 
Gravitational Waves 



As a first application of the ideas developed in the previous chapters, let us 
focus on a simple example: tensor perturbations in cosmology, or, gravity waves 
in FRW spacetimes. We will treat the case of scalar perturbations in Chapter 
7. 

We can treat gravity waves and their back reaction independently of the 
effect from scalar perturbations because the physical processes that give rise to 
gravity waves and to scalar perturbations are disconnected: while scalar per- 
turbations couple to energy density and pressure perturbations, gravity waves 
evolve in a completely independent way from the matter distribution. Therefore, 
the two types of perturbations are not correlated, that is, the average (<ph) = 0, 
where <f> is the newtonian potential and h is the strength of the gravitational 
waves. Therefore we have two EEMT's, one for gravit ation al waves and one 
for scalar perturbations. The back reaction equations ( 5.6C| ) are linear on the 
EEMT's, therefore the total back reaction R™ is just the sum of the partial 
back reactions from the tensor modes, i?™ ^ and from scalar perturbations, 



R'o 



(S) 



6.1 Energy-momentum tensor for gravitational 

waves 



The metric for tensor perturbations was given by Eq. (2.22), 

ds 2 = dt 2 - a 2 (t)(S lk + h ik )dx l dx k , (6.1) 

where obeys the constraints h\ — h l j i = 0. It was shown in Section 2.4 
that these quantities are unchanged under gauge transformations, that is, the 
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variables hij are automatically gauge invariant. 

The starting point to derive the EEMT of gravity waves is the second vari- 
ation of the Ricci tensor [see, e.g., B], 



-R^,mn6g m 5g n 



+ fig™ 13 {^9af3\iiu + Sg^ap ~ Sg at i\vl3 ~ ^9au\tifi) 

+ 5g^{5g a ^-5g Ma ) (6.2) 



where | denotes covariant derivatives with respect to the background metric. 

It is also useful to recall the equations of motion for the gravity waves in 
normal time, 



3Hh l:j - = 



(6.3) 



which can be used to simplify the expressions for the Ricci and Einstein's tensor 
containing gravity waves. After performing the spatial averaging, we get 



T oo — -(hki hu) + - ( (hklhki) H — ^{hu,mhta,m) 
a 8 V a 



(6.4) 



and 



3.- ■ 

-r:\hkihki) 



+ -a 2 (hikfikj) + -^(hki,ihki,j) — -(hik,ihjk,i) 



(6.5) 



The first expression can be interpreted as the effective energy density of gravi- 
tational waves, 



= r° . (6.6) 

We note that, only in the case of the EMT of gravity waves, r^, = 7 a " t t (TI , where 
7 M „ is the background metric (the relationship is more complicated when the 
EMT of the matter perturbations is nonzero - see Chapter 7.) This result also 
coincides with the result obtained through Landau's pseudotensor of energy and 
momentum Q in the gauge specified in formula ( 2.22Q . 

The relation between Ty and the quantity which we could naturally interpret 
as an effective pressure is not so straightforward as it looks at first glance. The 
problem is that the energy momentum tensor for the gravity waves r^, is not 
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conserved itself, that is, t^, ^ 0. This is not surprising since the gravitational 
perturbations "interact" with the background and only the total EMT must be 
conserved. Consider the exact Bianchi identities for the metric perturbations 

G$ ilt = d, (ot [7] + G^ m P m 7 +Gt >m R m 7 \G% mn P m P n + ..)) (6.7) 

(1%, [7] + r a ^ m P m + r^ m R m + l -T^ mn p m p n + . . .) x 

( G M H + G' 1 P m 7 G' 1 R rn 7 -G* 1 p m p n + 

— 1 r^ 1 w 4- p m 4- r^ 1 R m 4- ir^ p m P n 4- 1 x 

+ Gl, m P™ 7 G«, m iT> 7 i^, ron P-P« 7 . . .) , 

where we have expanded the connections T and the Einstein tensor G up to 
second order in the perturbations P and R. 

These equations are true in each perturbative order, as it should since the 
Bianchi identities are geometric identities. We can greatly simplify these equa- 
tions by decoupling the several classes of terms from each other. All zero and 
first order terms drop out since they are satisfied independently of the next 
order equations. Moreover, all second order terms which are linear in R m also 
drop out, since they are zero by themselves^]. 

We are left with the following equations for the EEMT t% = \G% mn P m P n 



^((G^) mn P m P n )=d^ (6.8) 

^(r^, m „P m P"> x G»M - (r^ m p™ x G^ m p m ) - r^[ 7 ] x ^ 

+ \{Ku, mn P m P n ) x G« [7] + (T^ m P m x G^ m P m ) + KM x r« . 

The terms involving r are what we would naively expect from the conservation 
equations for the EMT of perturbations, t . The remaining terms in the 
second and third lines are corrections stemming from the interactions of the 
perturbations with themselves and with the background. 

1 R m are independent degrees of freedom. Another way of seeing that this is true is ob- 
serving that the Bianchi identities for R m are given by (G^, [b]) t mR m ■ We could also regard 

v I fl 

R m as linear perturbations in the same sense that h,j are linear perturbations, in which case 
the Bianchi identities are automatically satisfied. 
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In the case when there are no matter perturbations, the terms on G, m P m are 
proportional to the equations of motion and therefore vanish by themselves, so 
we find that the only correction comes from the terms involving second variations 
of the connections. We have, finally, that in a FRW background metric the 
stress-energy tensor of gravity waves obeys the equations 

p gw + ZH{ Pgw + n gw ) + \(K^ mn P m P n ) ( Po + Po ) = , (6.9) 

where Tr gw = — t$(t\), and p a and p are the energy density and pressure of 
the homogeneous background matter. Usually, the continuity equations for 
hydrodynamic fluids in FRW spacetimes are of the form p + 3H(p + p) = 0. 
Here, instead, due to interactions between perturbations and background, there 
are additional terms corresponding to an effective pressure. We can write then 

p giv + 3H(p gw +p gw ) = , (6.10) 

where 

Pgw = TTgw ~ — (A y 7l«)(/) + p Q ) (6-H) 
OH 

can be interpreted as the pressure of gravitational waves |53|. The second term 



in (6.11) does not contribute to the pressure only in a de Sitter Universe in 
which p — —p . In this case the term —\t\ itself can be interpreted as a 
pressure. 

6.2 Gravitational waves: short wavelength limit 

We will now study the back reaction and the equation of state for gravity waves 
in the limit when they have short wavelengths and high frequencies (HF) in 
comparison with the curvature radius |o/d|. To calculate the EEMT in some 
given background we need 1) the mode solutions h l £{t) and 2) the spectrum 

5 | ^ ^ ' * ' ' ' ' ^ ' ' ' ' ^ ^ ' ^ ' ' ' ,: ' ' ^ ' ' ' >-"!'■■- hii^ ii slit li-l. • ■, [ i i . \ I'll 1 i ; t 



|<5^| 2 that determines the rms amplitude of each mode with wavenumber k. 



The solutions to the equations of motion (3.S) in the limit k ^ (<V a ) 



are simple plane waves [see, e.g., Eq. (2.63)], since short-wavelength waves 
propagate without being disturbed by the curvature. The frequency of course, 
depends on the physical wavenumber k p h = k/a(t) so we have 

hij {x,t) = ^=J -^k-'/^ls^Me'^^+o.c] . (6.12) 

If we assume that there is an ensemble of gravitational waves which are 
isotropically distributed, their average value is given by a sum over all the 
modes in all directions. If, in addition, we average over a time period T such 
that fc _1 «T« -ff -1 ) the rapidly oscillating terms vanish and we have 
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I ^ l^r^. (6.13) 

where <C • • • 3> denotes space and time average. 

A single plane gravitational wave propagating in the z direction, i.e. k z = 
(0,0, k z ), has a polarization tensor which is traceless and transverse to the z 
direction, hence the only nonzero elements of the tensor are en = —£22 and 
£12 = £ 2i- The strength of the gravity wave is given, in this case, by 

«h^(z)h* J (z)»=2\A\ 2 (\e 11 \ 2 + \e 12 \ 2 ) , (6.14) 

where we have used that 5 h (k z ) = Ak\ S(k — k z ), and A is the adimensional 
constant that sets the amplitude of the wave. The polarization tensor is usually 
normalized as eye*^ = 1, so that en = £12 = 1/V2- In terms of spherical 
polarizations, we have e± = (1 ± i)/V2. 

We can now evaluate the effective energy density and pressure of HF grav- 
itational waves. Notice that the effect of a time derivative acting on a wave 



mode hij is to bring down a factor of k/a(t). The terms in (6.4) and (6.5) 
containing a/a are negligible, since k ^> H~ l . Likewise, the correction terms to 
the pressure in Q6.ll ) can be neglected. After collecting the diagonal elements 



we conclude that, for short wavelength gravity waves, the energy and pressure 
are given by 



1 _ 1 

i Pgw ~ 12a 

In terms of the physical momenta, the integral can be cast in the form 



Pgw = ^Pgw = 77772 < hu,mhkl,m > ■ (6.15) 



Pgw - -J ^kl h \5 h k \ 2 , (6.16) 



7T_ f dk p h j 2 i ,k|2 

J kph ^ 

where we have used isotropy to transform the integral over all space into an 
integral over the norm k = \\k\\. 

The result above is the usual one[§, and expresses the fact that gravitational 
waves in the geometrical optics limit (short wavelength and high frequency) be- 
have essentially like radiation, independently of the evolution of the background. 
Their equation of state is p gw — p gw /3, which is consistent with the time de- 
pendence p gw oc a~ 4 (t). Notice that this time dependence follows from the 



continuity equations (6.1C), by neglecting the correction terms which are small 
in the context of HF gravity waves. 

In the case when the power spectrum is scale invariant, \5%\ = M, the 
integral over physical momenta has a quadratic divergence on the UV, a fa- 
miliar feature from quantum gravity where divergences are usually of the sort 
J dkk 271 ^ 1 for n loops. In Einstein's theory there is no consistent way of reg- 
ularizing these graphs, so the theory is termed non-renormalizable. That the 
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integral (3.16) is divergent is a symptom of this fact, since the space average 
that we use corresponds to a 2-point correlation function, or a 1 loop graph. 

If the spectrum \6%\ has time-dependent cut-offs the integral ( 6.1 6| ) has an 
additional time dependence coming from the limits of integration. As we dis- 
cussed in Chapter 5, though, on the grounds of the conservation equations we 
do not expect this additional time variation to be of much consequence. In the 
case of HF gravity waves the physical IR cut-off could be set as the scale Hit) 
of the background, and the UV scale could be some fixed microscopic scale that 
depends on the cross-sections of the graviton with the other sectors of matter. 
Since the Hubble parameter is at most constant, and usually decays as a power 
law of time, the energy density of HF gravitational waves is dominated by the 
UV, thus a time dependence in the IR is usually irrelevant. 



6.3 Gravitational waves: long wavelength limit 



The solutions to the equations of motion for the gravitational waves, Eq. fl6.3[ ), 
now depends on the background evolution. The back reaction effects of long 
wavelength gravity waves are then substantially different than the short wave- 
length oncsj53|. We will treat here the cases when the background is dominated 
by radiation (p = p/3) and vacuum energy (p = —p): 



o(t) = 



1/2 



Radiation 
DeSitter. 



(6.17) 



Since we only need the solutions in the limit where k/aH <C 1 we can use the 
trial solutions 



ha(t) = k~ 3/2 5%t 



k 

aH 



(6.18) 



and solve for the factor a in lowest order in k by inserting into the equations of 
motion for hij. For de Sitter a = —1/6 while for radiation a = 1/2. 

In the de Sitter spacetime we obtain, after substituting the solutions above 
into Eqs. JS.4| ) and (6.5) and taking into account that p a = —p Q we have that, 
to lowest order in k 



and the pressure is 



Pgw 



7?r 
2^2 



dk 



h\2 



(6.19) 



Pgw 



(6.20) 



For a scale-invariant spectrum the integral is logarithmic, therefore in terms 
of physical momenta the time dependence of the energy density and pressure 
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is still a 2 . Notice that this time dependence is in perfect agreement with the 



equation of state (6.20) and the continuity equation. 

The fact that the energy density in long wavelength gravitational waves is 
negative has been known for some time in connection with proposals that the 
cosmological constant should decay by Hawking radiation However, unless 
the spectrum of the waves is severely scale-dependent (not a very persuasive 
suggestion in de Sitter spacetime), the influence of LF gravity waves should 
decay as a~ 2 . The problem with the proposal is that, once outside the Hubble 
horizon, the energy density of the gravity waves decays fastly with time, al- 
though their amplitude remains more or less constant. In Einstein's theory the 
— component of the Einstein tensor does not contain terms like H 2 (h l ' J 'hij) , 
which would contribute a non-decaying negative contribution to the energy den- 
sity. In quantum gravity with a cosmological constant in the far infrared, loop 
corrections yield just that sort of term, and it has been suggested |5Sf that in 
this case the back reaction from gravitational perturbations will lead to a decay 
of the cosmological constant. A central element of this mechanism is the over- 
population of the low momentum sector of the spectrum, which we described in 
Fig. 4.2. 

In a radiation-dominated universe the situation is a bit more complicated, 



since this time the corrections to the pressure (6.11) must be taken into account. 
Substituting the solution for the gravity waves into the equation for the energy 
density we have that 

and the effective pressure is 

P aw = ~ P -f ■ (6.22) 



The bare pressure is actually 7r glu = — jgp gw , but the second term in ( 6.11 ) 



contributes with Ap — y| p gw . 

Because of the approximation we have done (neglecting terms which are of 
higher order in the momenta) now the divergence is logarithmic, instead of the 
quadratic divergence that happened previously in the case of hard gravitons. 
This integral still necessitates a UV cut-off, but its origin is now physically 
sensible: the UV limit is the comoving horizon scale H~ 1 a~ 1 . The nature of 
the IR cut-off is the same as we encountered before in the treatment of quantum 
fluctuations inside an inflating bubble, and is related to the largest scale over 
which we can consider our background to provide accurate information about 
our universe. Any time dependence on the cut-offs will enter in the expression 
for the energy density of gravity waves only as a logarithmic factor, therefore 
deviations from the a~ 2 behavior can be neglected as small corrections. 

The results above show that the energy density and pressure fall with a~ 2 for 
a scale-invariant spectrum, as in the previous case. This is of course consistent 
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with the effective equation of state p gw — —p gw /3. The energy density in long 
wavelength gravity waves is again negative, but this time it is decaying less 
fast than the background energy density p m oc a~ 4 , which means that the back 
reaction from LF modes could become important. 
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Chapter 7 



Back Reaction in the 
Chaotic Inflation Model 

We now want to focus on the case of scalar perturbations in the early universe. 
We consider as a background model the chaotic inflationary scenario @, of Chap- 
ter 3, where the inflaton potential is V — m 2 (p 2 /2. We derive the equations in 
longitudinal gauge, where the gauge invariant variables defined in Section 2.5 
coincide with the variables <fi and ip. Furthermore, since the perturbed energy- 
momentum tensor of the scalar field is diagonal, ip = <fi. As we have argued 
in Chapter 5, the analysis is independent of the coordinate frame insofar as we 
consider only gauge invariant variables in predictions of physical quantities. 

The fundamental scale of the chaotic background model is the self-reproducing 
scaleQ, 

<p ar = m- 1 ' 2 . (7.1) 

In regions where the scalar field is above this scale the universe is foregoing 
a process of continuous creation of new inflationary "bubbles" fueled by large 
quantum fluctuations of the scalar field. On the other hand, regions that see 
the scalar field drop below the self-reproducing scale behave as an independent 
bubble that evolves following the classical equations of motion. We will assume 
that the region over which the FRW description is approximately valid is this 
"classical" bubble of physical radius given by H~ 1 (t sr )a(t)/a(t sr ). 

We will perform the analysis using covariant components, and it is a straight- 
forward matter to show that the final results are equivalent in any possible 
components one happens to choose. 

1 We stress once again that our units are such that 8irG = M 2 = 1, therefore in Planck 
mass units tp 3r = M p i sj M p i /m 3> M p i . 
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7.1 Effective energy-momentum tensor of scalar 
perturbations in Chaotic Inflation 



We use again the expression ( |6.2| ) for the second variation of the Ricci tensor, 
where the metric now is given by 



ds z 



(1 + 2(/))dt 2 - a 2 (t)[(l - 2(f)dx 2 



(7.2) 



where x are the comoving coordinates. The energy-momentum tensor of the 
scalar field matter is 



T, 



<p' a <p. a - V(ip) 



(7.3) 



With these inputs, the expressions for the EMT of perturbations are 



-„„ = L-lti\<M>) -3((0) 2 ) +9a- 2 ((V0) 2 ) 

((^) 2 ) + ^- 2 ((vM 2 ) 



12H 

1 

2 
1 



V"{<p ){8<f?) + 2V'(<p ){<t>6<p) 



(7.4) 



and 



a 8i 



(2AH 2 



16H)(0 2 ) 
4 



2AH 



+ {(4>f) + 4(00) - - a - 2 ((V0) 2 ) + 4^3 2 (. 
+ i((^) 2 )-ia- 2 <(VM 2 )-^oW) 



1 



V»(<p ){8<fr l ) + 2V'(<p ){<j>S<p) 



(7.5) 



where H is the expansion rate. 

First we recover some familiar results for short wavelength scalar perturba- 
tions. In the limit k 3> aH , both <p and Sip oscillate with a frequency oc k. In 
this case [see (7.9)]: 



1 ia, . c 



(7.6) 



Hence, it follows by inspection that all terms containing <j) in (7.4) and ( |7.q ) are 
suppressed by powers of Ha/k compared to the terms without dependence on 
<t>, and therefore 



93 



roo ^ - < m 2 > +-a- 2 < (VM 2 > +- V»(<p ) < 6<p 2 > (7.7) 

and 



Tij = a 2 S tJ ji < (Sip) 2 > -V 2 < (V6<p) 2 > -\ V"(tp ) < 6<p 2 > j . 

(7.8) 

It can be checked that HF scalar perturbations have the equation of state of an 
ultra-relativistic gas (radiation), p = p/3, and the energy density in a fixed set 
of modes decays accordingly, p oc a~ 4 . 

We now focus on the more interesting long wavelength, low frequency (LF) 
perturbations. First, some general remarks about the problem of back reaction 
of scalar perturbations in a model with a general potential V(ip). We have shown 



in Section 2.6 [see formula (2.67)] that the spectra of 5(p and are related by 
the constraint 

4> + H<l>=^(p 8(p . (7.9) 



During inflation, <p is proportional to a slow-roll parameter and can be neglected. 

v' 

' 3H> 



We have then, using that ip ~ — 4tt , that the spectra of LF scalar perturbations 



are constrained by 



2V 

s^pk = —yf 4>k = -<fo4>k ■ (7-io) 



When considering the contributions of long wavelength fluctuations to r y 



fit/ 1 



we can neglect all terms in ( [7.4[ ) and ( |7.5[ ) containing gradients and 4> factors. 
Because of the "slow rolling" condition, the terms proportional to ip 2 and H 
are also negligible during inflation (but they become important at the end of 
inflation). Hence, in this approximation 



r 00 ~2 V " < 5{ P 2 > +2V ' < ^ > ( 7 - U ) 



and 



a 2 h 3 I —p;H 2 < cj) 2 > -I V" < Sip 2 > +2V' < <p5<p > \ . (7.12) 
(^ 7rG 2 J 



Making use of (7.10), this yields 



V"V 2 



(7.13) 



94 



and 

p s ^-y=- P w . (7.i4) 

Thus, we have shown that the long wavelength perturbations in an infla- 
tionary Universe have the same equation of state p s — —p s as the background. 
When the scalar field tp is slowly rolling down its potential, the energy density 
of the EEMT of perturbations is negative, which means that back reaction lower 
the effective potential energy density of the background. 

It might be objected that it is not consistent to throw away terms in the 
background energy density like (fi 2 /2, and still consider the back reaction terms 
above. However, the background terms that have been discarded do not change 
appreciably during inflation, whereas we will show below that the contributions 
from back reaction are a growing function of time and, as inflation proceeds, 
their strength increase by many orders of magnitude. 



7.2 The back reaction of long wavelength modes 



We will use the spectrum of perturbations created by quantum fluctuations of 
the scalar field to calculate the intensity and the time dependence of the back 
reaction terms in the model V = m 2 ip 2 /2. We will calculate, in addition to what 
was done in the last section, next-to-leading order terms in the energy density 
and pressure of the EEMT of perturbations, using the slow-roll approximation 



H / H 2 -c 1. From Eq. (3.68) we take that, during inflation, the power spectrum 
of the field <f> is given by 



1 



log 



(7.15) 



where <p (tf) is the time when inflation ends. To arrive at this formula we made 
use of the solution to the H ubble const ant and scale factor during slow-roll given 
respectively by Eqs. ( 3.21 ) and ( 3.25 ). 

We can use the spectrum of <f> to derive the spectra of every term present in 
Eqs. (7.4) and ( [7.5[ ). For example, from the definition of the spectrum we have 
that 



d 
dt 



(7.16) 



The average value of the fields (<j>8tp) over the comoving volume between 
the Hubble horizon at iJ _1 (i)/a(i) and the "particle horizon" of the inflating 
bubble at H~ l (U) j 'a(U) is given by the integral 



(#?)t 



fc W dk 

hi & 



(7.17) 
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where ki = H(ti)a(U) and k{t) = H(t)a(t) are the comoving momentum cut- 
offs. Notice that the negative sign is due to the fact that the constraint (7.10) 
implies that the spectra of <f> and dtp are anti-correlated. There is a sensible 
physical reason for that: the newtonian potential is lowest where the mass 
density is highest, and vice-versa. The interaction term (4>Stp) thus expresses 
an interaction energy, which lower the total energy of the system and therefore 
has a negative sign. 

We could proceed and calculate all the terms of the energy d ensity and 
pressure of the perturbations that improve upon the expressions (7.13) and 
( 7.14 ), but the re is a more elegant way of doing this calculation. We can use 
the constraint (7.10) linking <f> and dtp to express the whole energy-momentum 
of perturbations in terms of the function <f> 2 times some time-dependent tensor. 
After that, we can make the averaging by simply calculating (4> 2 ). 

Fro m Eq . ( 2.80 ) we can deduce, after using the expressions for the scale 
factor ( |3.25 ) and the approximation H = —tp tp /2 (valid for the potential 



m tp 2 /2 while inflation lasts), that 



At 



(7.18) 



and, due to ( [7.10 ) 



dtpk 



-At 



tfio{t) 



-tp (t)4> k 



(7.19) 



From these expressions it is easy to see that, e.g. 



^ = _ 2 ^ and p.= 

4>k tPo Otpk 

If we use these constraints into expressions 



<fo_ 

tfo 



(7.20) 



J7.4D and (7.5) for the energy 
density and pressure, and neglect terms of order (1/tp 2 ,) < 1 we get after some 
algebra that 



rA 1 - 



-3p 






dU3 Po -8tp 2 ) 



(7.21) 



where p a and p are the background energy density and pressure, respectively. 
The term 8tp 2 is a small correction, but we keep it because we want to discuss 
the continuity equations that t£ satisfies. The fact that the energy density 
is negative implies that the biggest contribution to the energy density of long 
wavelength scalar modes comes not from their self-energy, but from their in- 
teraction energies. The main contribution to the formula (7.4) for the energy 
density comes from the term 2V'(cf>Sp), which has a negative sign due to the 
anti-correlation between the two variables [see the constraint (7.1C ).] But for 
the correction term above, we would have the approximate EEMT of scalar 
perturbations given by Eqs. (7.13) and (7.14), or 
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r/,' = -3<0 2 )T^ O ] . (7.22) 

This shows that the tendency of back reaction is to counteract the effective 
vacuum energy from the scalar field that drives inflation. The crucial point 
is, does this effect grow in time, or does it amount to a renormalization of the 
energy of the scalar field? If it grows, is there a time when back reaction becomes 
so strong that it ends the period of inflation? 

Clearly, to answer these and other questions we must finally calculate the 
corre lation (4> 2 )- This is straightforward, with the sp ectru m given in formula 
( [7. 15| ) and the solution for the scale factor, Eq. ( 3.25 ). Assuming that we 



calculate the average of 4> 2 at a time when ff <C foif) f%, where <PiU) 
V 5 o(^i(/)), we have that 



k ^ dk 
k 
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dx(l - xf 



(7.23) 



^log(^), using expression ( 3.25 ) for 



after changing variables to i 
the scale factor and neglecting small logarithmic corrections of the order of 



log 



H(tf 



in the lower limit of the integral. We have then, approximately, 

3 



(i 



288tt 



2 r 



¥>§(*) 



288tt 2 ipi(t) 



(7.24) 



that is, ((j> 2 )t is growing with time as <p~ A and consequently the influence of 
back reaction becomes more important the longer inflation takes. The EEMT 
is proportional to <Pi/<f%, where <pi is the value of the scalar field when inflation 
has started in the bubble that we are considering. The bigger ipi is in a Hubble 
size patch, the larger that patch will be at the end of inflation, and the more 
important back reaction will become. 

If we consider that inflation ends when ip <~ iff <~ I, then by demanding 
that back reaction never becomes important in our patch of the universe we 
constrain the initial value of the scalar field in that patch to be 



Po(tf) « \Ps(tf)\ 



<Pi <C m 



-1/3 



(7.25) 



If inflation started with the scalar field above this scale, then back reaction 
would have been important for the dynamics of the universe. Notice that the 
scale m -1 / 3 is much smaller than the self-reproducing scale ip sr = mT 1 / 2 . 

We can also estimate the scale above which back reaction is important at 
any time: if ip > mT 1 the energy in the EEMT of perturbations is of the same 
size as the energy density of the background. Of course, this bound is of limited 
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value since it is applied to a region where we expect quantum gravity to be valid 
(mV>l[M p 4 J.) 

It is much more interesting to estimate the time when back reaction would 
become important, given a certain initial value of the scalar field ipi. From 



(7.22) and (7.23) we find that the value iptr of the scalar field at which the 
energy density in the EEMT becomes of the order of the energy density of the 
background is 

p br = to i7 V- /2 ■ (7-26) 

If we suppose, as suggested by global structure of the Chaotic Inflationary sce- 
nario, that the initial value of the scalar field is ip ST = m -1 / 2 , we get that back 
reaction became important when the value of the scalar field reached 

^ =m 1 / 2 (m- 1 /2 ) 3/2 = m -i/4 _ (72?) 

It is natural to suppose that when the scalar field reaches this value inflation 
should end, since the negative energy density of perturbations is of the same 
size of the energy density of the background. In that case it is valid to say that 
the effective energy density of the background has relaxed to nearly zero. 

Another estimate can be made for the strength of the EEMT of perturbations 
if we consider that the size of the visible universe today corresponds to an entire 
inflating bubble. Since this region inflated at least 50 e-folds, we have from the 



formula for the scale factor (3.25) that p 2 — ip 2 ~ 200. The relative size of the 
terms of the EEMT at the end of inflation is then, with m ~ 10 -6 , given by 

{4> 2 ) ~ io- 8 . 

We can thus summarize some of the interesting features of the back reaction 
of LF scalar perturbations: 

• The energy density is negative; 

• \p s \ is growing in time; 

• The equation of state is approximately that of a negative cosmo- 
logical constant term, p s ~ — p s . 



7.3 The continuity equation and the Klein-Gordon 
equation 

There are two loose knots that we should secure before resting our case. First, 
the fact that the energy density of the EEMT is growing faster than the energy 
density of the background is not consistent with the equation of state p s ~ — p s 
of Eq. ( 7.22| ). The corrections that we derived in the last chapter [see Eq. 



( |6.11 )] are very small and cannot account for this discrepancy. And second, 
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what is the role of the Klein-Gordon equations, and are they consistent with 
the picture given by the EEMT that we derived? 

First we compute the continuity equation for the EEMT. Starting from the 



expression (7.21) and using the relation (7.2C) we find that 



nVo 2 2/ 

Ps - 3 — m <p { 
and thus we find that, by using H ~ 



l + 0(— ) 



(7.28) 

- i Po i Po/2, the continuity equations are 



Ps + 3H(p s + Ps ) = <p (-2m 2 ip ) (4> 2 ) , (7.29) 

to lowest order in slow-roll parameters^. This equation would imply that the 
EEMT is not conserved separately because of the interaction with the back- 
ground. However, let us consider the case of the Klein-Gordon equation, which 
can be obtained through the 4-divergence of the total energy-momentum of the 
scalar field: 



(T$ ;ll ) = tPoiUh+Sg&o + S<p) + V'( Va + Sip)) = 



(7.30) 



Expanding it to second order in the perturbations, and prior to making any 
simplifications we have that 



(<p„ + 3H<p )(l + A{4> 2 )) + V' + -V"'{S<p 2 ) - 2{(j>5ip) 



-4(<p6<p) - 6H(<p5<p) + Aip 



:(cpV 2 5 V ) = 



(7.31) 



As usual, the spatial derivatives can be neglected for LF perturbations. 

Now, the equation above indicate clearly that the background scalar field 
is corrected by some source terms proportional, roughly speaking, to (</> 2 ). A 
correction term to the scalar field must be included, if we are not to hit upon 
inconsistencies. This correction to the homogeneous value of the scalar field can 
be cast in the form 



<p (t) — > <p (t) + e 2 v(t) 



(7.32) 



which is precisely the zero mode of the second-order variables r m that we men- 
tioned in the 5 th chapter. Of course, the metric must also be altered [see Eqs. 
( |5.53j )], and the simplest ansatz is to change the zero modes by 



i(t) — > a(t)[l + ( 2 w(t)\ 



(7.33) 



The terms from Eq. (p.llh are much smaller than the ones shown also by a factor of 1/V„. 



which implies that 
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H(t) — > H(t) + e 2 h(t) 



(7.34) 



where h(t) = w(t). 

These corrections must be included so that the second-order Einstein's equa- 
tions can in fact be solv ed perturbatively. Doing these substitutions and after 
using the relations (7.2C) we have, to second order, 



T^.J 2 > =ip (v + 3Hv + 3h<p + m 2 v - 2m 2 ip (<f) 2 )) = . (7.35) 

Notice that the last term on th e RHS of the above expression is precisely the 
extra term found in Eq. ( 7.29| ). The refor e there is nothing wrong with the 



continuity equation calculated in Eq. ( 7.29| ), except that we did not allow for 



the degrees of freedom from back reaction: if we include those, as we should, 
then then the continuity equations imply the equation of motion above. 

In order to close the picture, the Einstein equations must also accomodate 
the degrees of freedom from back reaction. We have, to second order in the 
perturbations and in the back reaction variables, 



6Hh = ip v + m 2 <p v — 3p ((j) 2 } , (7.36) 
-6Hh-2h = tp v-m 2 (p v + (8(p 2 -~3p )((l) 2 } , (7.37) 

to lowest order in small parameters like H / H 2 <C 1 and l/vv 2 <C 1. 

It can be verified that the integrability conditions for the system of equations 
(fzliel)- (p^) is given by the corrected equation of motion ( |7.35|) for the back 
reaction v(t) on the scalar field ip (t). The LHS of (7.36) and (7.37 should really 



be interpreted as the effective second-order dynamical corrections to the energy 
density and the pressure of the background. Of course, to answer what these 
corrections are one should solve the coupled equations above for the variables 
v (t) and h(t) in terms of <fi (t) (the background metric is already in terms of 

Let us conclude the proof that there are well-behaved solutions v(t) and h(t) 
to the system of equations abov e. We have already shown that the continuity 
equations for the RHS of ( [7.36 ) and ( 7.37 ) imply the equations of motion for 
the back reaction on the scalar field, ( 7.35 ). Now, we prove that the Bianchi 
identities for the back reaction metric variable h = w on the RHS are satisfied 
independently of the continuity equations. Taking the the expression G^ v .^ for 



the metric ds 2 — dt 2 
obtain 



2 (t)[l — 2w(t)]dx 2 and expanding it to second order we 



dt 



{6Hh) + 3H{~2h) + 3h(~2H) = 



(7.38) 
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where we remember that the connection term 3H — > 3H + 3ft. when we perform 
the covariant derivatives. The Bianchi identities are therefore satisfied for the 
back reaction variable h = w. 

Summarizing, we found that the effective energy density of the effective 
energy-momentum tensor of long wavelength scalar perturbations is negative, 
and counteracts the energy density of the background. It grows with time, and 
becomes more important as inflation proceeds. In addition, the equations of 
motion obeyed by the variables expressing the back reaction of the perturbations 
onto the background are completely consistent (they obey the Bianchi identities 
and the conservation equations.) 

Of course, in order to give final answers to the questions about the dynamics 
of the back reaction corrections, if they actually change the back groun d or i f 
this effect eventually vanishes, we must solve the s ystem equations ( 7.36 )-( 7.37 ) 
complemented by the equation of motion ( [7.35 . This will the subject of a 
forthcoming publication p3. 



7.4 Conclusions and Discussion 

We have analysed the problem of how perturbations can effect the background in 
which they propagate. The object that expresses this nonlinear feedback effect is 
the effective energy-momentum tensor (EEMT) of perturbations. Fundamental 
problems related to the gauge dependence of the EEMT have been addressed, 
and the resolution lies in keeping track of all second-order terms coming from the 
gauge transformations. We found that upon a coordinate transformation that 
redefines the perturbations, the background variables themselves must be rede- 
fined due to second-order zero modes coming from the gauge transformations. 
Based on these observations we have been able to set up a gauge-independent 
back reaction formalism. 

We applied that formalism to the problems of back reaction of gravitational 
radiation and scalar perturbations in cosmological spacetimes, and found that 
the EEMT of long wavelength fluctuations has a negative energy density, cor- 
responding to a slowdown of the expansion. In the case of gravitational waves, 
after including correction terms to the pressure that should be accounted for 
if the time dependence of the EEMT is to be consistent with the continuity 
equations, the equation of state becomes p gw = —p gw /3. 

For scalar perturbations, we deduced a consistent set of equations that ex- 
press the back reaction on the background. The equation of state of the EEMT 
is that of a negative cosmological constant, p s — —p s . This corresponds to an 
instability of de Sitter space to long wavelength perturbations, an effect which 



has already been noticed in Ref. 58 



Our back reaction formalism is purely classical in contrast to the quantum 
effects discussed in Refs. |6(| pT| . However, the source of the fluctuations which 
we consider is quantum mechanical (perturbative quantum gravity), and in this 
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sense back reaction may be able to probe quantum gravitational effects. For 
the spectrum of perturbations predicted in inflationary cosmologies our back 
reaction effect is dominated by the infrared, a result similar to the one found 
in the analysis of Ref. [^9| , where the effect of tensorial quantum gravitational 
fluctuations which overpopulate the infrared was calculated. 

In the context of a chaotic inflationary universe scenario, back reaction effects 
can become very important to the dynamics of the background before the end 
of inflation, if inflation began in that region with high enough initial values of 
the scalar field. If this initial value is considered to be the self-reproducing scale 
f sr = m^ 1 / 2 , then our conclusion is that back reaction must be taken into 
account before the time when the scalar field reaches the value tpb r — mT 1 / 3, . 

On a more speculative note, back reaction might also provide a dynamical 
relaxation mechanism whereby a preexisting cosmological constant is cancelled. 
The cosmological constant is hundreds of orders of magnitude smaller than the 
value suggested from dimensional analysis or even from supersymmetry, which 
indicates that some mechanism, very likely a cosmological one, is acting to 
preclude its existence from showing up in our observations. We have seen how 
the back reaction from scalar perturbations have the effect of counteracting the 
effective cosmological constant of the scalar field potential. In that picture, 
inflation ends when the value of the energy density in the background scalar 
field is balanced by the energy density coming of the EEMT of long wavelength 
perturbations. Analogously, we speculate that back reaction will act to relax the 
cosmological constant of the universe to an effective value which is comparable 
to the energy density in ordinary matter (a conservative bound given the present 
observational data.) This would solve one of the longest-standing problems of 
theoretical physics. 
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Appendix A 

De Witt's conventions 



Here we describe the conventions used for functional operations and functional 
derivatives of tensor fields, initially described in Chapter 4 and used throughout 
in this monograph. 

Consider tensor fields w^.'.'.'(x <T ) (w(x) for short) which are defined in the 
manifold M. with respect to the coordinate chart x" '. We say that w is a 
tensor functional of other tensor fields f (x a ) if w(x) can be expressed as 
a local function of f(x) and its derivatives df. Of course, general covariance 
implies that generic tensorial expressions should also involve the metric g and 
its derivatives dg in the covariant derivatives of /. We collect the "variables" / 
and g in the following way: 



i m {x) = [f$t(x)Jtt(x),C:::(x), ■ ■ ■,/J£v:(*),soo(aO,sbi(aO, ■ ■ ■ )533 (a0] > 

(A.l) 

where m is a "field" index running from 1 to the total number of degrees of 
freedom in / and g, provided that degrees of freedom are not counted twice (for 
example, we include goi but not gio = goi-) 

The functional derivative of w(x) with respect to q m (x') is defined in terms 
of the infinitesimal variation q m (x') + 5q m (x'), 

8w(x) w[q n (x)+Sq n (x)]-w[q n (x)} 



Sq m (x') Sq^(x')^a Sq m (x') 
where it is understood that 

W$j = ^-^ < A - 3 > 

for two independent degrees of freedom /" and f m . 

A variation of the functional w under a fixed variation Sq m (x) of the variable 
q m can be expressed as 
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Sw(x) 



I 



Sw(x) 



■ 5q m {x')dx' 



(A.4) 



8q m {x') 



Clearly, from the formula above, Sw(x)/8q m (x') is an operator acting on the 
variation Sq m (x). For example, suppose that w is the Ricci tensor R^ Vl and the 
q m are just the metric. We have in this case the differential operator 



acting on the fixed variation 8g a p{x'). After operating on 5g and integrating the 
result, we obtain what is commonly called the linearization of R^ v with respect 
to the metric perturbation 5g a p(x). 

To facilitate the discussions, we have adopted De Witt's condensed notation^ 
which assumes that the field index m includes the continuous coordinates x as 
well, so for example q m (x') = q m . We also use an extension of Einstein's con- 
ventions that whenever field indices appear twice they should be summed over 
and the expression should be integrated, so that for example the contraction 
and integration of the tensors /„„ and g^ in this notation reads 



5R^{x) 
&g a p{x') 



8(x - x') [S^V(x')\7 a (x') + V^zOV^O^V) (A.5) 
-^V ff (a/)V„(z')<7 a V) ~ ^V ff OOV M (aO<rV)] 




(A.6) 



Finally, for functional derivatives we use the short-hand notation 




(A.7) 



where m now is a dummy index, w iTn idq m 



w^dq 1 
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Appendix B 



Transformation Law for the 
Ricci Tensor 



Here we prove some useful formulas connected with the Lie derivatives of rie- 
mannian objects. 

B.l Metric and connections 

Consider a metric that underwent a gauge transformation af — x^ + £ M (x) , 

9\iv Q\iv ("^-(jS)^ 9p.v 9^v,a£, 9a^£,,v 9av£, t ^ > (-^.1) 

and the contravariant components transform as 

~ gfl U = _ {Cigr = ^ _ + 9 ^ + gOVQ (B 2) 

It can be easily verified that (f^g^v = + 0(£ 2 ), and we remind the reader 
that £ is a "small" vector in the sense that g^ — g^ — 0(e). 

Under a gauge transformation the connections T transform by the following 
formula, obtained after a bit of algebra: 

per -per -per £a . pa £cr per £a per /-a 

lit* i±i> ^i/,q^ "1" ^i/S,a! au^.^i afi^.v 

= T- v - (L £ r)° v - ^ . ' ' (B.3) 

We have used the definition of the co nnec tion, — \g aX {gx^.u + g\v,n — g^u.x) 



and substituted into it expressions (B.l) and (B.2) for the metric in the trans- 



formed coordinate frame x (we neglect all 0(£ 2 ) terms here 
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B.2 Ricci tensor 



We want to demonstrate now the identity L^R^ v [g] — R^ LV , m {C^g) r 
consider the Ricci tensor calculated with metric (B.l), 



First, 



+ r^[(£ £ r)^ + ^] -r^ Q [(£ £ r)^ + ^] . 

At this point it is useful to derive the following identity for the Lie derivative 
of the connections, 

[(£ s r)«J a = - r^Q a + r^ CT + v^ va (b.s) 

where the first term on the RHS is understood to be the Lie derivative of r^„ <7 . 



Using this formula into expression (B.4) we get that 



We must compare now the expression (B.6) with the result of the Lie deriva- 
tive of the Ricci tensor, 



y£(;R) fiu — Rfiv,a£, Rafi£,,v ~t~ Rotu^L 



(B.7) 



where i? is given in the first line of (B.4) with T instead of F. It is easy to check 
that all the terms on (B.6) cancel exactly those in (SOI), therefore 



{(I - £t) ■ RU[g] = R^{(1 ~ £ s ) ■ g] 



(B.6 



where 1 is the identity. This completes the proof that, to first order, the differ- 
ential operator 1 — £^ acting on the functional R[g] is equal to the functional R 
evaluated on the "o pera ted" variables (1 — C^)g. 

Expanding Eq. (B.S) in powers of £ we have that, to first order, 



CtR\g] = R, m ■ 



(B.9) 



which is the desired result. 
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